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Chapter 1 


Introduction 


1.1. Prologue and Preview 


With renormalized perturbation theory we are not dealing with 
an ordinary power series, but with a more subtle mathematical 
structure. A renormalized series, such as 


R =a(l +ria+rza? +---), (1.1) 


is an expansion in a parameter a that is indefinite, in that there are 
” 
? 


infinitely many “a’s,” all fundamentally on an equal footing. They 


are related by transformations of the form 
a’ = a(1+uja+ vga? +--+) (1.2) 


with arbitrary finite coefficients v1, v2,.... The series coefficients are 
correspondingly indefinite; they transform as 


/ 
Ty = ri = UI, 


r} = ra — 2ryv, + wW? — v9, (1.3) 


so that R 
R =d (1 +rla pre” +s") (1.4) 
remains invariant. 
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The invariance of R is called “renormalization-group (RG) 
invariance.” The essential point is that renormalization — the 
procedure for handling ultraviolet divergences in quantum field 
theory (QFT) — introduces a “renormalized couplant” a, whose 
precise definition — the “renormalization scheme” (RS) — involves 
a great deal of arbitrariness. Changing from one RS to another 
induces transformations of the form (1.2). However, any physically 
measurable quantity R must be RG invariant: that is, independent 
of the RS used. 

By itself, the mathematical structure just described is too 
“floppy” to be of much interest. It is the physical consideration of 
Dimensional Analysis that gives the problem a backbone. Any RS 
involves choosing a “renormalization scale” u, which is an arbitrary 
parameter with dimensions of mass. The dependence of a on p is 
given by the famous 8 function: 


da 


Mn = b(a) = —ba? (1 + ca + c2a? + cza? +---). (1.5) 


The £ function plays a key role and is needed to connect a to the free 
parameter of the theory (a suitably defined A parameter). The first 
two coefficients, b and c, are numbers that can be unambiguously 
calculated in any given QFT. It is straightforward to show (see 
Sec. 6.2) that both are invariant under the transformations of 
Eq. (1.2). The higher coefficients c2,c3,... can also be calculated, 
but their values depend on the RS choice. The coefficients r1,r2,... 
in the perturbative expansion of any specific physical quantity R can 
also be calculated (in the same RS). Although the r;’s and the c;’s 
are separately RS dependent, there are certain combinations of them 
that are RS invariants. By forming these invariants one distils the 
RG-invariant, physical content of the Feynman-diagram calculations 
from the scheme artefacts. 

Beyond the formal level one must define, at a given order, the 
analog of the “partial sum.” In a given RS that involves truncating 
both the R and £ series at the same order. However, the approximant 
thus defined is RS dependent, creating the notorious RS-dependence 
problem. Standard QFT textbooks typically mention the issue, and 
then mostly ignore it. Yet, without a resolution of the problem, the 
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results of perturbative QFT will remain unclear and ambiguous. 
(The complacent assumption that the difficulty becomes less and 
less severe at higher orders is unsafe because the fixed-scheme series 
are divergent.) The problem, it should be stressed, is not with the 
theory itself, but arises from the approximation. 

The approach in this book is founded on the Principle of Minimal 
Sensitivity, which is a philosophy that applies quite generally to any 
“non-invariant approximation.” In such situations the exact result 
is known to be independent of certain “extraneous parameters,” yet 
the approximate result depends upon them. The idea is to choose 
the values of the extraneous parameters so that the approximant 
is minimally sensitive to small variations of those values. Thus, an 
exact symmetry of the exact result is mimicked by a local symmetry 
of the approximate result. Applied to RS dependence, this principle 
identifies an “optimal” result that is determined, for a given physical 
quantity at a given order, by a set of “optimization equations.” From 
those equations we can solve for the optimized r; coefficients in terms 
of a and the c; coefficients. If we know, from Feynman diagram 
calculations, the numerical values of the invariants to that order, 
then we may obtain the optimized result for R numerically, by a 
suitable iterative procedure. 

Optimization has interesting consequences, particularly for the 
infrared limit, and also for the question of high-order behaviour, 
where we conjecture that optimization may produce a convergent 
sequence of approximations, despite the generic divergence of QFT 
series in any fixed RS. 


1.2. Plan of the Book 


The core of the book is Part II, a thorough exposition of “optimized 
perturbation theory” (OPT). The impatient reader, already familiar 
with QFT and renormalization, may wish to begin there. However, 
the chapters making up Part I provide background material intended 
to set the RS problem in context and to explain the thinking behind 
this approach. It is hoped that students of QFT will find these 
chapters helpful in clarifying some key conceptual issues in the RG 
and renormalized perturbation theory; issues that can sometimes 
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be obscured by technical details in textbooks. The chapters on 
non-invariant approximations and induced convergence are relevant 
to a huge variety of problems in all areas of physics and applied 
mathematics. 


Parenthetical remarks, such as this one, are interspersed in the 
text in just this format. They may often be ignored without affect- 
ing the reader’s understanding of the main line of argument. They 
discuss more technical issues, or address questions, complications, 
or apparent difficulties that might occur to some readers at that 
point. 


Chapter 2 discusses Dimensional Analysis in QFT, which is the 
key to understanding how a seemingly dimensionless bare coupling 
constant turns into an “effective” or “running” coupling constant 
that varies with energy. It also explains much about why RG 
equations arise. 

Chapter 3 discusses renormalization in conceptual terms, sup- 
pressing all technical details. It emphasizes that coupling-constant 
renormalization, for physical quantities, is essentially a substitution, 
a reparametrization of the theory. The ill-defined (infinitesimal) bare 
coupling constant is eliminated in favour of a finite renormalized 
couplant, thereby re-organizing the perturbation series into a usable 
form. However, the freedom inherent in defining a renormalized 
couplant leads, at any finite order, to the RS ambiguity. 

This RS-dependence problem arises only because we truncate 
the perturbation series. It is not fundamentally a QFT issue, but 
rather is a problem in “approximology” — the theory and lore 
of approximations. We therefore open Chapter 4 with a general 
discussion of approximations, stressing that they involve taking a 
gamble based on incomplete information. A good gambler, however, 
always makes wise use of all available information. For “non-invariant 
approximations,” the vital extra information is the knowledge that 
the exact result does not depend on certain variables. The Principle 
of Minimal Sensitivity, which makes use of this information, is 
explained and applied to several examples, in particular the Caswell- 
Killingbeck (CK) expansion for the anharmonic oscillator in quantum 
mechanics. 
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Chapter 5 discusses the phenomenon of “induced convergence” 
that can occur in high orders of non-invariant approximations. As 
well as the CK expansion, it discusses a toy model involving the 
alternating factorial series, where the convergence of the optimized 
results can be proved quite simply. We conjecture that OPT in 
quantum field theories will also show such induced convergence, even 
though the R and £ series, in a fixed RS, are factorially divergent. 

In Part II the main work begins. Chapters 6 and 7 set out the 
formal structure of renormalized perturbation series. They explain 
how to parametrize RS dependence and identify the invariants 
pi formed from the coefficients of R and (a). The discussion, 
unfortunately, has to begin with u dependence, which arises at 
the lowest order, but which is, in some ways, the most difficult 
and confusing issue. Thus, it is first necessary to discuss several 
preliminaries in Chapter 6: the Ø function; its integration; the 
appropriate definition of a boundary-condition parameter A; and 
how A depends on RS. Only then can one see that the first of 
the RS variables is not u itself, but the ratio of u to A. A special 
energy-dependent invariant, 9;(Q), is involved here. The way is then 
clear to discuss, in Chapter 7, the other RS dependences that can 
be parametrized by the higher-order -function coefficients cj for 
j = 2,3,.... Here things are in some ways more straightforward. A 
set of invariant quantities p; can be defined as specific combinations 
of the coefficients of R and (a). Only at the end can one see how 
the u/A dependence corresponds to the “j = 1” case of the general 
formulas. 


It is as if a teacher, introducing integral calculus for the first 
time, were compelled to first discuss f dz, necessitating a lengthy 
discussion of the natural logarithm function, ln x, before turning 
to the integrals of positive integer powers and deducing the simple 
formula fda x”! = z” /n. Only then, by considering n = e, could 
the dz result be seen as part of the general pattern — the subtlety 
being that a ~1+elnz as € > 0. 


Chapter 8 addresses the issue of finite orders, where series are 
truncated and exact RS invariance is lost. It explains how to apply 
the Principle of Minimal Sensitivity, from which all later results 
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follow. Chapter 9 shows that the resulting optimization equations 
can be solved to yield the optimized rm coefficients in terms of 
the optimized c; coefficients and the optimized couplant a. It also 
outlines algorithms for numerically solving the remaining equations 
iteratively. Chapter 10 presents in detail some illustrative numerical 
results for the R,+,- ratio in QCD. 

Part III discusses various special topics. Chapter 11 explores 
the infrared limit, where perturbation theory, for an asymptotically 
free theory, is furthest from its comfort zone. Chapter 12 treats the 
important topic of factorized quantities, such as moments of the 
proton structure function, where one factor, the operator matrix 
element, is not perturbatively calculable. Chapter 13 is of more 
specialized interest. It considers QCD in the small-b limit (known as 
the Banks—Zaks or BZ limit) where one can explore some aspects of 
all-orders OPT. Although not addressing the main issue concerning 
the “induced convergence” conjecture, because the fixed-RS series in 
this limit is not divergent, these explorations do shed some light on 
how OPT works at very high orders. 

Some issues, falling outside the main narrative, are discussed in 
appendices to some of the chapters. A few end-of-chapter exercises 
are included; these are mainly intended to enable readers to test and 
reinforce their understanding of the material, but sometimes contain 
results of interest, though not of central importance. References 
have been kept to a bare minimum in the text, though names are 
mentioned where important credit is due. More extensive citations 
of the prior and contemporaneous literature can be found in the 
author’s research papers. These are listed in the bibliography along 
with other signposts to the literature. The bibliography also mentions 
a selection of other applications of the minimal-sensitivity criterion; 
in particular some promising developments in QCD that go beyond 
the scope of this book. 


1.3. Quantum Chromodynamics 


The principal application of OPT is to quantum chromodynamics 
(QCD) so we will very briefly describe the theory. This is only 


Introduction ik 


intended to give readers without much particle-physics background 
a general feeling for the theory — we mainly want to show how the 
gauge coupling g (and hence the couplant a = g?/(477)) enters. 
Readers are advised to allow the technical details to “flow over 
their heads” if necessary, or to consult other textbooks for a fuller 
description. 

QCD is a relativistic quantum field theory. Being relativistic, 
it is natural to use units where the speed of light is unity, and to 
use covariant notation. 4-vectors, such as the space-time position 
x” = (t,x) and energy-momentum p” = (E,p), have a Lorentz 
index u = 0,1,2,3 that is lowered with the Minkowski metric tensor 
Ju = diag(1, —1, —1, —1). An invariant is formed by z“ z, = £” gy, x” 
(with summation over repeated indices always understood). The 
upper or lower index distinction is crucial for Lorentz indices. 

Being a field theory, QCD’s degrees of freedom are fields — which, 
at the classical level, are just functions of space-time position. The 
simplest kind of field is a scalar field ¢(a), and the simplest theory, 
describing free, spinless particles of mass m, has the Lagrangian 
density 


1 1 
L= 57 One = ge. (1.6) 


It can be thought of as the continuum limit of a set of oscillators, 
one at each spatial point, each coupled to its nearest neighbours. 

Being a quantum theory, the fields are upgraded to operators, 
with quantum-oscillator commutation relations. Alternatively, one 
can formulate the quantum theory in terms of a functional integral, 
over all possible classical field configurations, involving the exponen- 
tial of the action, fd*xZ. 

QCD involves spin- particles (quarks) and spin-1 particles 
(gluons). Particles with spin are described by multi-component fields. 
Spatial rotations and Lorentz boosts result in a mixing of the 
components (in a way that we will not attempt to describe here). 
A spin-1 field A“(x) has four real components, labelled by a Lorentz 
index p that can take the values 0,1,2,3. A spin-4 field W(x) has 
four complex components, labelled by a Dirac index a. 
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The Lagrangian density for a free spin-+ fermion is 


L= Da (il ap On = m a8) Yp. (1.7) 


Here 4” are the four Dirac matrices, satisfying an anticommutation 
relation {7“,y’} = 2g"”. Henceforth we will suppress the Dirac 
indices entirely, leaving it understood that a matrix multiplication is 
involved. The shorthand notation @ for y“0, is also convenient, so 
we have 


L=p(id—m)v. (1.8) 


w is the adjoint of w (Hermitian conjugate times the 4° Dirac matrix). 

This Lagrangian is obviously invariant under a change of phase 
from w to ety, since w transforms to ey. Here 0 can be any 
constant. Suppose, however, that we want to have a gauge symmetry 
where 0 can be a function of space-time position x. That can be 
achieved if we introduce a “covariant derivative” 


D! = ð! + ig AH (1.9) 
involving a new vector field A” that transforms as 


ar — A" Long, (1.10) 
g 


In order for the A, field to be dynamical we need to add a kinetic 
term for it. Naively, this would be something like 0, A"0” A,,, akin to 
the kinetic term in Eq. (1.6), but that would not be invariant under 
the gauge transformation (1.10). The appropriate term can be found 
by considering the commutator of two covariant derivative operators: 


[D", D”] = ig(O" A” — 0" A") = igF"” (1.11) 


so that the appropriate gauge-invariant kinetic term is ~} Fp F”. 
If we identify the fields A, and w with the photon and electron, 
respectively, and the gauge coupling with the electric charge e, then 
this is Quantum Electrodynamics (QED): 


Lorp = -Ew E +5 (ip — m) Y (1.12) 
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Unlike the previous Lagrangians, which were quadratic in the fields 
and so were theories of non-interacting particles, the new Lagrangian 
has a cubic term coming from the Pw term, which is 


— gA pyy. (1.13) 


This term, treated as a perturbation to the free-electron and free- 
photon theories, corresponds to a vertex in the Feynman diagrams 
where a photon line meets up with an incoming and an outgoing 
electron line. 

To get QCD we start from a “quark” field, a threefold fermion 
field pa, where a is the “colour” index; a = 1,2,3 (or a =red, green, 
blue). Summing over colours, as in Yaa, produces a “colourless” 
combination. Crucially, all three fields have the same mass so that 
the Lagrangian 


Paid — m)ba (1.14) 
has an exact symmetry under pa — Y4, with 
Y, = User, (1.15) 


where Ua» is any 3 x 3 unitary matrix with unit determinant. This 
symmetry is called SU(3) colour symmetry. (There is also a symmetry 
under an overall et? phase factor, which gives rise to electromagnetic 
interactions of the quark with the photon field.) A continuous group, 
such as SU(3), is characterized by its “Lie algebra,” the set of 
commutators of its generators T A, 


[T4, TP] =ifapcT®. (1.16) 


For SU(3) the index A runs from 1 to 8 (= 3? — 1) and is raised or 
lowered with a simple Kroenecker delta 64g. The group’s structure 
constants, fagc, are totally antisymmetric in the three indices. The 
individual elements are pure numbers. (fi23 = 1, fass = fers = 
V3/2, fiar = fies = fz = fos7 = fsa = fave = $, with the 
others fixed by antisymmetry, or otherwise zero.) The generators 
should be thought of as abstract objects, but there are various 
matrix representations of the Lie algebra. The relevant ones here are 


the fundamental representation, where the T4’s are represented by 
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3 x 3 Hermitian matrices T4, and the adjoint representation, where 


ab? 
they are represented by 8 x 8 matrices Tän whose matrix elements 
are directly given by —ifaBc’s. A general U,, matrix can be written 


as 
Ua» = explið aT) (1.17) 


with eight parameters 04 multiplying the eight TA matrices. 
Again, we wish to make this symmetry into a gauge symmetry 
where the 6,4’s can be functions of x. We need a covariant derivative 


DE = 0" Ligh *Gh (1.18) 


involving an eightfold gauge field G4 (the “gluon” field) that 
transforms as 


1 
Gt, — G4 — fapt GC — 50 9A (1.19) 


The fagc term is the natural one for the transformation of an object 
with an adjoint index A; the additional inhomogeneous term makes 
G% a gauge field. 

Again, we need a kinetic term for this gauge field. To find the 
right gauge invariant form, we again consider the commutator of two 
covariant derivative operators: 


[D“, D] = ig 4 (0"G%, — 0"G4, — gfapcGP4¥GO”). (1.20) 


Note that the commutator here involves two sorts of non-commuting 
objects; derivative operators and generators, with the latter giving 
rise to the gfG? term. The analogue of the F“” tensor in QED is 
thus 


GH’ = OGY, — a°G4, — gfagcG? GO. (1.21) 


This object transforms in the “proper” way for an object with an A 
index — like Eq. (1.19) but without the derivative term. Thus, when 
we form the kinetic term, 


1 
- 555,94", (1.22) 


it is gauge invariant. 
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There are various different “flavours” of quarks, so we need yet 
another index j to label them that is summed from 1 to ny, the total 
number of flavours. In the real world, we know of six flavours called 
u, d,8,c,b,t. Each flavour has a different mass. The Lagrangian for 
QCD is then 

1 a 
Lacp = E a + S Pila — Mjðab) Yo, j. (1.23) 
j=1 
(There are more technicalities about gauge fixing, Fadeev-Popov 
ghosts, and BRST invariance that we do not go into here, though 
they are crucial for proving the theory’s renormalizability.) 

Besides the quark—quark—gluon interaction term of the form 
—g(wytw)G from the J term, there are also three-gluon and four- 
gluon interaction terms g(0,G)G? and g?G*. These give rise to 
corresponding vertices in the Feynman diagrams. 

Feynman-diagram calculations, especially at higher orders, are 
much more complicated when masses are included, so it is usual to 
approximate “real QCD” by massless QCD with an effective number 
of “active” flavours. (Quarks with masses above the relevant energy 
scale Q are regarded as “inactive” and the lighter quarks are treated 
as massless. ) 

Other gauge theories can be constructed in a similar fashion, 
starting with a different symmetry group, such as SU(N) instead of 
SU(3). In all cases, the gauge coupling g is naturally dimensionless 
in 4 dimensions. 
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Part I 


Renormalization and Its Ambiguities 
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Chapter 2 


Dimensional Analysis 
in Quantum Field Theory 


2.1. Physics and Conventions 


A quantitative description of a physical process always involves a 
convolution of two parts: a Ph part — the physics, the phenomenon, 
the “phacts” — and a C part — the conventions, choices of units, 
coordinate system, etc., chosen (by committees) for concreteness, 
convenience, and clear communication: 


Quantitative 
description of a | = Physics | ® | Conventions 


physical process 


All the messiness and arbitrariness lies in the C part, while the 
beauty resides in the Ph part. The actual physics is fundamentally 
independent of the arbitrary choices in the C part. Thus, for instance, 
Articles I and II of the Physicist’s Creed are 


I. Physics is independent of our choice of units. 
II. Physics is independent of our choice of coordinate system. 


These points, though elementary, are nevertheless profound. 
Article I is fundamental to our discussion of renormalization-group 
equations here. Article II, for non-inertial space-time coordinate 
systems, leads towards General Relativity. As physics advances, 
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it is striking that our descriptions tend to involve ever more 
sources of redundancy: for instance, physics is independent of our 
choice of Lorentz frame, our choice of gauge — and our choice of 
renormalization scheme. 

Theoretical work can often be done in mathematical formalisms 
where these invariances are manifest. However, when it comes to 
making concrete, quantitative predictions that can be compared with 
experiment, we are forced to adopt some definite system of units, 
coordinate system, etc. In doing so it is vital that we do not lose 
sight of the fundamental invariance of the physics to those arbitrary 
parameters. 


2.2. Dimensional Analysis 


Let us focus now on Article I: — physics is independent of our choice 
of units. Most quantities in physics are not pure numbers. When we 
describe a length as “3.2 metres” we mean that the ratio of that 
length to another specified length, previously defined to be “1 metre,” 
is the number 3.2. Generally, quantities in physics have dimensions 
and must be expressed in some well-defined units before they can be 
identified with numbers. Traditionally the fundamental dimensions 
are taken to be mass, length, time, with the dimensions of other quan- 
tities being taken as combinations of these. (For instance, angular 
momentum has dimensions (mass) (length)? (time)~!.) Dimensional 
Analysis (DA) is based on the fact that it only makes sense to add, 
subtract, or equate two terms if they have the same dimensions. This 
simple observation — as any good physics student appreciates — 
provides a powerful check against mistakes in calculations. It also 
provides a way to predict, in advance, much about the form that a 
result must take. DA applies to QFT too, of course. In some ways it 
is even simpler, though in other ways it is more subtle. 

DA in QFT is simpler because the fundamental constants A 
(Planck’s constant over 27) and c (the speed of light in vacuum) 
are ubiquitous and provide natural units of angular momentum and 
velocity, respectively. Thus, we may reduce the three fundamental 
dimensions to just one, which may be taken to be mass (or, 
equivalently, energy, or inverse length, etc.) and all quantities can 
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be regarded as having dimensions of mass to some power. We will 
use the phrase “a massive parameter,” to mean “a parameter with 
the dimensions of mass.” 

DA in renormalizable QFT’s is more subtle because of the need 
for renormalization. This subtlety shows up most clearly in massless 
(or, more precisely, classically scale-invariant) theories through the 
counter-intuitive phenomenon of “dimensional transmutation.” The 
bare Lagrangian of a massless renormalizable theory contains no 
parameters with dimensions, only a dimensionless bare coupling 
constant g; nevertheless, the theory ends up being characterized by 
a scale parameter “A” with dimensions of mass. The aim of this 
chapter is to explain how dimensional transmutation is consistent 
with DA. We will show that much, though not all, the content of the 
RG equations presented in QFT textbooks follows simply from DA. 
In the following discussion, the theory is assumed to be a massless, 
renormalizable QFT with a single bare coupling constant, g, with 
massless QCD as a specific example. (Appendix 2.A comments briefly 
on theories with masses.) 


2.3. The Dimensional Transmutation “Paradox” 


For the purposes of discussion, let us picture our theory as a “Black 
Box” that provides the answer to any well-posed, physical question 
that we put to it. This device — alas, rhetorical not actual — is 
employed in order to separate, as far as possible, the theory in itself 
and the way it works, from the way in which we work the theory. 
Later on we will return to the complications that arise because we are 
forced to use approximate methods — indeed, those complications 
are the main subject of this book. For the present, however, let us 
pretend that we have a “Black Box” that can solve the theory exactly. 
The “paradox” of dimensional transmutation can then be described 
in the following terms. 

The Lagrangian we input to the Black Box is scale-invariant — 
it contains no massive constants. Let us ask the theory to give its 
prediction for A(Q), a specific, dimensionless, observable quantity, 
which can depend only on one massive variable Q. For example, 
in QCD we could ask for the dimensionless ratio of cross sections 
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Re+e- (Q) = otot(ete~ — hadrons)/a(ete~ > pt) at a center-of- 
mass energy Q. The answer from the Black Box turns out to have a 
non-trivial dependence on Q. In QCD, Re+e-(Q) tends to a constant 
only as Q — oo; there are definite sub-asymptotic terms that vary 
with Q. 

Finding that the dimensionless A(Q) actually depends on Q is 
quite a shock because we know that the following is true: 


Theorem (Dimensional Analysis). A function f(x,y) which 
depends only on two massive variables x and y, and which is 


(i) dimensionless, 
(ii) uniquely defined, 
(iii) whose definition does not involve any massive constants 


must be a function of the ratio x/y only. 
Corollary. If f(x,y) is independent of y, then it must be a constant. 


The question we have asked of the theory is of the type covered 
by the corollary: the theory has to define a dimensionless function of 
one massive variable, and it has no massive constants available to it. 
Thus, the answer to our question must be a constant — and yet, the 
answer from the Black Box is certainly not a constant! 

The flaw in the “paradox” is the requirement that the function 
must be uniquely defined in the theorem above. If this requirement 
is dropped, then requirement (iii) can easily be circumvented by 
allowing a massive constant to appear, implicitly, as a constant of 
integration. The point is that the Lagrangian we input is not one 
single theory but a one-parameter set of theories with different (bare) 
coupling constants, g. Therefore, our Black Box gives us not just one 
answer, but a one-parameter set of answers — that is, it specifies 
the dimensionless function A(Q) non-uniquely. The paradox is then 
resolved: the DA theorem is correct, but it does not apply, because 
the uniqueness requirement is not satisfied. 

Dimensional Transmutation is the process whereby a theory 
exploits its one-parameter ambiguity so as to evade the dull conse- 
quences of the DA theorem. In order to use this mechanism, a theory 
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must (i) have at least one free parameter, g, and (ii) have infinities 
(or be ill-defined in some other way) so that its predictions cannot be 
expressed directly in terms of g. In field theories, the second require- 
ment is met by the necessity for renormalization. A field-theoretic 
Lagrangian is not a complete description of the theory: in order 
to make sense of it one must use some renormalization procedure. 
The physics is independent of all the details of the renormalization 
prescription, but not of the necessity for renormalization. 

To summarize: we asked the Black Box a question: “What is 
A(Q)?”, naively expecting that the answer would be a constant. 
Having now understood that the theory will give us only a one- 
parameter ambiguous answer, we can reconsider what DA tells us 
about the form that this answer must take. 


2.4. The Ambiguous Answer for A(Q) 


The Black Box is asked to specify A(Q) and — while it is allowed 
a one-parameter ambiguity — there are no massive constants it 
can use. So, in order to produce a non-constant A(Q), the theory’s 
definition must be somewhat indirect. One can immediately think of 
two possibilities: the black box could specify only the first derivative 
of A(Q), or it could define A(Q) recursively. These two alternatives 
are actually equivalent, as we shall show explicitly. 
If the theory specifies the first derivative of A(Q), then this 
specification must take the form 
aia (2.1) 
dQ Q 
where B(A) is a dimensionless function of a dimensionless argument 
and must be uniquely defined by the theory. The important point 
is that the form of Eq. (2.1) is unique: the explicit Q-dependence is 
fixed by DA. 
If instead the specification of A(Q) is recursive, then it must have 
the form 


AQ) =F (LAW) for any p (2.2) 
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where F is a uniquely defined, dimensionless function which must 
be independent of the arbitrary massive parameter u, because no 
massive constants are allowed in the specification of A(Q). Again, the 
form of the equation is dictated by DA. The one-parameter ambiguity 
arises because the theory does not provide a boundary condition for 
Eq. (2.1) or Eq. (2.2). This boundary condition must be fixed by an 
appeal to experiment. 
Let us first analyze Eq. (2.1). Integrating the equation gives 
A dA 


ln Q + constant = |. BA) = K(A). (2.3) 


For present purposes a lower limit of oo is simple and convenient. 
However, that choice would not be appropriate in perturbation 
theory and a different definition will be discussed later in Sec. 6.3. 


The constant of integration can be written in infinitely many 
ways, corresponding to different choices of mass units, by writing 
“constant = Ko—ln u,” where u is some arbitrary massive parameter. 
By considering the point Q = p one sees that Ko = K(A()), so that 


In(Q/p) + K(A(u)) = K(A(Q)). (2.4) 


What has really happened is that a massive constant has crept in as 
an integration constant. One way of characterizing this constant, A, 
is as the value of u for which A (A(yw)) vanishes (assuming that such 
a point exists). Thus Eq. (2.1) is equivalent to 


A(Q) = K (K(A(u) +1n(Q/p))) for any p, (2.5) 


or to 
A(Q) = K7!(In(Q/A)) for some A. (2.6) 


Equation (2.5) holds for arbitrary u, and OA/Opu|g = 0 for all Q, 
as the reader should verify. Equation (2.6), however, holds only 
for some fixed, but unspecified constant, A. We could have added 
“where A is defined by K(A(A)) = 0,” but this “definition” is mere 
tautology, since this information is already contained in Eq. (2.6). 
The unspecified nature of A reflects the one-parameter ambiguity of 
the theory. 
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Next let us investigate Eq. (2.2) to show that it also leads to 
Eqs. (2.5), (2.6). Changing the variables Q, u to 


z=Q/p, A=Alu), (2.7) 
the condition that F is independent of ju is 
dF OF dz OF dA(u) 
— = — — + — (z, A —— =0. 2. 
po e | we + Ee, | =o es) 
Since use = —z, this becomes 
OF OF dA( 1) 
—(z,A —(z,A)| = 2.9 
Tea] / Be Ji pt (2.9) 


The right-hand side of this equation is independent of z and is a 
function of A only. In fact, it is precisely the function B(A) of 
Eq. (2.1). Equating the left-hand side of Eq. (2.9) to B(A) leads 
immediately to 


OF OF 
A(nz)  O(K(A))’ (2.10) 
where 
A 1 
K(4) = i. ay (2.11) 


as before. The solution to Eq. (2.10) is of course that F can depend 
only on the single variable K(A) + Inz. Hence, reverting to Q, u as 
the variables, 


F(Q/u, A(u)) = f (K(Au)) + n(Q/p)) , (2.12) 


where f is a function of a single variable. The left-hand side is, by 
definition, A(Q), so by considering u = Q one sees that A(Q) = 
f(K(A(Q))), so f must be K-t. Thus, one recovers Eq. (2.5) 
as promised. Equations (2.1) and (2.2) are therefore equivalent 
definitions of A(Q). In fact, any answer the Black Box could give 
would have to be equivalent to the form (2.1). 

Notice the curious feature that the discussion often involves 
an arbitrary massive parameter, u. Its role is that of a conjurer’s 


22 Renormalized Perturbation Theory and Its Optimization 


handkerchief — “Now you see it, now you don’t!” It is necessary in 
order to write down certain formulas, but physically it is never really 
there. 

The massive constant A is quite different. Its value does have 
physical significance. The theory does not, however, predict its value. 
In fact, since A is not in its input Lagrangian, the Black Box cannot 
ever mention A. Nor does it need to: it simply leaves a boundary 
condition unspecified. We ourselves introduce A as a convenient 
means of parametrizing the theory. By expressing the theory’s 
predictions in terms of A and then fitting these to N experimental 
measurements, we can extract the value of A and make (N — 1) tests 
of the theory. 

To summarize: There is essentially only one form in which the 
Black Box can give the answer to our question “What is A(Q)?” 
It must provide a fully specified function B(A), where B and its 
argument are dimensionless, together with the statement that A(Q) 
is related to B(A) by Eq. (2.1). This is equivalent to Eqs. (2.2), 
(2.5), (2.6) as shown above. The Black Box cannot ever mention 
a massive constant in its answers. Nevertheless, we can see that 
a dimensional constant does creep into the theory because of the 
absence of a boundary condition. 


2.5. The Form of Other Physical Quantities 


Now that we understand the form of its answer for one specific 
physical quantity, A(Q), let us ask the Black Box to predict some 
other physical quantity. The answer cannot be an independent repeat 
of the previous story, since this would introduce a second “A” 
parameter, and there is only a one-parameter ambiguity in the 
theory. Instead, having already given us A(Q), the Black Box can 
now give all its other results in terms of A. 

Consider some observable quantity o(q1,-.-,@n) with dimensions 
of (mass)?, which depends on n massive variables q1,...,@n. Let us 
pick one of these variables — say qı, though it could be any qi or 
any combination with the dimensions of mass — and rename it “Q,” 
and then rewrite the others as dimensionless ratios 02 = q2/Q, etc. 
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If the usual (unique) DA applied, 7 would have to take the form 
o(Q,02,...,0n) = QPS (02,..., 0n), (2.13) 


where the function S' is dimensionless and depends only on the (n—1) 
ratios. This form will no longer hold because of the one-parameter 
ambiguity of the theory, now embodied in A(j). Reapplying DA with 
this in mind the correct form must be 


o(Q, 02,..-,9n) = QPS @ -n 2A) , (2.14) 


where S must be independent of u, the arbitrary massive parameter 
which serves as the argument of A. 

Let us analyze the form of Eq. (2.14). Firstly, since y is arbitrary, 
we are free to pick u = Q if we please. Hence, the dependence of S 
on Q/ and A(n) is equivalent to dependence on the single variable 
A(Q). 


One can also show this in a more laborious way. Equation (2.14) 
is similar in many respects to Eq. (2.2), but without the latter’s 
recursive nature, so one can repeat the analysis in Eqs. (2.7)- 
(2.12) to show that S depends on Q/p and A(u) only through 
the combination K(A(u)) + In(Q/). This variable, by Eq. (2.5), 
is just K(A(Q)). 


Another way of expressing the p-independence of Eq. (2.14) is 
the following. Define a function S of (n + 1) independent variables 


5 = 5 (0,-0.0 2A), (2.15) 


which reduces to S when the extra variable A is set equal to A(). 
S becomes independent of u only when A = A(j). Translated into 
mathematics, that statement becomes 


ws] _ (08, a8 aa 
= \ou OA du 


= 0. 2.16 
a (2.16) 
Multiplying by u and remembering that A(u) is specified by 


at A=A(p) 


at A=A() 


yp = BA), (2.17) 
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Eq. (2.16) becomes 


(uo + Bu) Š 


=0. 2.1 
Ou OA ee) 


at A=A(1) 


The reader will probably recognize this as a renormalization group 
(RG) equation of some kind. Indeed, the equation can be applied to 
g itself: 


Gar + Bawi) o=0. (2.19) 


Here the notation is perhaps a bit sloppy and may need a little 
explanation. In Eq. (2.18), it was evident that u and A were 
independent variables, so that each partial derivative is taken with 
the other variable held constant. Then, afterwards, one is to replace 
A with A(u). In Eq. (2.19) we dispense introducing A, so we have 
added “...|4”as a reminder that the u partial derivative is to be 
taken pretending that A is a constant, despite the fact that A is a 
function of u. 

There is nothing magic or mysterious about this renormalization- 
group (RG) equation. Solving this partial differential equation only 
leads back to Eq. (2.14), together with the verbal statement that 
o is independent of u. The form (2.14) followed simply from 
Dimensional Analysis, the only subtlety being the theory’s one- 
parameter ambiguity. 

That subtlety requires a distinction between canonical dimen- 
sions and scale dimensions. The canonical dimension of o is just 
D, and it counts both the Q and p-dependences since both are 
massive parameters. The scale dimension, however, only counts the 
dependence of o on the physical scale Q: 


Dio) = e (2.20) 


Since S = ø /QP is a dimensionless function, we know from DA that 
it can depend on Q only through the ratio Q/j, as in Eq. (2.14), so 
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that 
Q ds uds 
SE 2.21 
S dQ S duja ( ) 
Note that we must hold A(x) fixed here. Hence, D(¢) can be expressed 


as D — oe F Using the RG equation (2.19), we then find 


zan (2.22) 


2.6. Summary 


We began by asking our theory, considered as a Black Box, about 
a specific, dimensionless physical quantity A(Q). Naïve DA, which 
would force the answer to be a constant, does not apply because the 
theory’s answer is not unique. Rather, “one-parameter ambiguous 
DA” applies, requiring the answer to be of the form of Eq. (2.1): 


Q= = B(A), (2.23) 


where the function B(A) is a dimensionless function of a dimen- 
sionless argument, and is fully specified by the theory. The one- 
parameter ambiguity arises from the missing boundary condition — 
which allows a massive constant A to creep in. When asked about 
other physical quantities, the Black Box specifies them in terms of 
the first quantity A(n) at some arbitrary mass scale u. The fact that 
the new physical quantity, ø, is independent of the arbitrary mass 
scale u is expressed by Eq. (2.19): 


C + BA) r=. (2.24) 


There is an obvious artificial asymmetry in the preceding descrip- 
tion of the theory. One particular physical quantity A is singled out 
as “special” and all others are expressed in terms of it. This is an 
arbitrary choice: we could choose any physical quantity to play the 
role of the “special” one. If we had a Black Box that solved the theory 
exactly, there would be no problem here: whatever description of the 
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theory we adopted, the results would be entirely equivalent. In fact, 
the special quantity does not need to be a physical quantity. It could 
be defined in theoretical, rather than experimental terms. A natural 
choice, especially in the context of perturbation theory, is to use 
some kind of “renormalized coupling constant.” If the theory could 
be solved exactly, it would not matter how we chose to define it. 
At finite orders of perturbation theory, however, there is a problem. 
Before we return to that issue — the main topic of this book — let us 
digress to consider unphysical quantities, such as Green’s functions.® 


2.7. Unphysical Quantities and “Anomalous 
Dimensions” 


Many of the quantities we deal with in field theories are not, 
even in principle, directly measurable quantities. These useful, but 
unobservable, quantities, such as Green’s functions or proper vertices, 
often suffer from bad diseases. For instance, they can be infrared 
divergent, and they can depend on unphysical parameters. In gauge 
theories they can depend on the gauge-fixing parameter. (That is 
an additional complication, ignored here; it is discussed in Appendix 
2.B.) The relevant point here is that such unphysical quantities can 
depend on ju. 

Of course, the advantage of having the Black Box is that we do 
not need to trouble ourselves with unphysical quantities. Such quan- 
tities are only needed as intermediate steps in calculating physical 
quantities — which our Black Box now enables us to obtain directly, 
with no labour. However, out of curiosity, we might like to ask the 
Black Box to calculate an unphysical quantity for us. The answer, 
though, will depend, not only on the variables we specify, but also on 
unphysical variables, including u. The “meaning” of this variable is 
buried in the Manufacturer’s Specification for the Black Box which 
describes the technical details of the renormalization prescription, 
gauge choice, etc., that the machine uses in its calculations. We need 
not be concerned with these details here. 


“The next section is useful background for Part III, but is not needed for Part II. 
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To begin with, let us consider a hypothetical example in which 
a specific unobservable quantity ['(q@,...,@n;4), with dimensions 
of (mass)? , happens to depend on y in a particularly simple way, 
namely 


P (qi; u) = u” Glai; u, A(t), (2.25) 


where the exponent y is a numerical constant and G is independent 
of u. 

The statement that T has the form of Eq. (2.25) can be translated 
into a RG equation. As in the previous section let us regard I as the 
special case of Î(q;; u, A) when A = A(u), so that, as an identity, 


dI or dA OV 
a = || —— 2.26 
du Ou du OA ey 
A H lat A=A(u) 
But, from Eq. (2.25), 
dr 
—=VI. 2.2 
Bu (2.27) 


Substituting this in Eq. (2.26), and invoking Eq. (2.17), gives 


=0, (2.28) 


o o ~ 
pe + BIA) — 7 i 
at A=A() 


which is recognizable as a typical RG equation, containing an 
“anomalous dimension” term, y. 

Again, there is no particular magic in having a partial differential 
equation. The content of this equation is much better expressed by 
its solution, which is Eq. (2.25) together with the verbal statement 
that G is independent of u. However, the virtue of Eq. (2.28) is that 
this form generalizes to apply to I’s which depend arbitrarily on p. 
For this general case y must now be regarded as a function defined 
by Eq. (2.27). 

From DA alone there is no reason why y cannot depend on 
qı %2 qn 


agog S well as A(u). If this happened in practice, then 


the RG equation would hardly be very useful. Fortunately, in field 
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theory the y’s come from the renormalization constants (the Z’s), so 
they have no dependence on any of the physical variables q1,..., dn, 
and hence, by DA, they can only depend on yw through A(u); 
thus, y = 7(A(u)). This happy circumstance is not altogether 
surprising: the various unmeasurable quantities must be able to 
combine together to produce observable quantities in such a way 
that their dependence mutually cancels. In consequence, the u- 
dependence of each quantity has to be relatively simple; e.g., in 
a Green’s function the -dependence must factor into separate p- 
dependences for each external line. 


Equation (2.28) is derived field-theoretically by considering the 
renormalization of the quantity I. The essential ingredients of the 
derivation are (i) the s-independence of the bare quantity, (ii) 
the cutoff independence of the renormalized quantity, (iii) their 
“proportionality” through some infinite renormalization constant 
(Tren = ZI pare) and (iv) Dimensional Analysis. 


The solution of Eq. (2.28) when y is a function of u, but not of 
G15-++54ns is 


Tagine) = Ča; p, Aw) = uP Glai; u, ACH), (2.29) 
where G is independent of u, and the notation u°]! means 
H du! 
pl = exp f EAW). (2.30) 
It is easily verified that the -dependence of T satisfies Eq. (2.27), the 


defining equation for y. Using Eq. (2.23), but with p as the argument 
of A, we may express ul as 


A(n) 
ull = exp | dz ae) (2.31) 


where x is a dummy “A” argument. 
The scale dimension of I, 


Qd 


Dimensional Analysis in Quantum Field Theory 29 


is (unlike y) a physical quantity. That is because the wavefunction- 
renormalization constant Z, multiplicatively renormalizing T, will 
cancel out since it is independent of the momentum arguments qi, 
and hence Q independent. Generalizing the discussion leading to 
Eq. (2.22) one finds 

Dery = D(A) + OE (2.33) 
where A = A(Q) here. The —7(A) term arises from the uhl factor 
inr. 

To summarize: the non-trivial content of the RG equation, 
Eq. (2.28), is that the anomalous dimension y depends on p only, 
and only via A(j). This implies that the ji-dependence of the 
unphysical quantity T (qi; u) factorizes and is not intertwined with 
the q;-dependence. 


2.8. Illustration: QCD in Leading Order 


In this section, we consider leading order in perturbation theory. 
For the present pedagogical purposes, we will ignore the fact that 
leading order in perturbation theory is not an exact solution of the 
theory. (Alternatively, we can view it as effectively exact if we limit 
our ambition to predicting only the leading asymptotic behaviour in 
the Q — oo limit.) Our aim here is to make contact with the usual 
textbook treatments of some classic QCD results. 

As the reader may well have already observed, the formulas in 
the preceding sections look much more familiar if A(Q) is identified 
with a,(Q), the QCD running coupling constant at an energy scale 
Q. This identification will indeed be made in this section. It does not 
matter here that a,(Q) is not a proper observable, in that it is not 
defined in experimental terms. Its precise definition will not matter 
for the current discussion. 

With A(Q) = a(Q) (we omit the s subscript henceforth) Eq. (2.1) 
is immediately recognizable as the relation between the coupling 
constant and the “8 function” 


—(Q) = B(a(Q)). (2.34) 
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In the literature there are several variants of this definition of £: 
sometimes it is defined in terms of g(Q), where a = g?/(47), and 


sometimes the variable is Q? rather than Q. We use the above form 
in this section because it is perhaps the most common in textbooks. 
Later, however, we will adopt a slightly different definition of @ in 
terms of a= a/r. 


The importance of the 2 function is that it is the theory’s oblique 
way of specifying a(Q). Notice that the sign of 8 governs whether 
a(Q) increases or decreases with Q. In QCD, the 8 function has a 
perturbation expansion which begins 


Bla) = boa? +---, (2.35) 


where { is a positive number, calculable from Feynman diagrams. 
For a theory with ny flavours of (massless) quarks, 6o = (33 — 
2n-)/(67). The K function of Eqs. (2.3), (2.11) is then, to leading 
order, 


= a da’ 7 l 
K(a)= L Rae Baa (2.36) 
Equations (2.5), (2.6) then yield the familiar results 
_ a(u) 
a(Q) = T+ balu) n(Q/n) for any H, (2.37) 
a(Q)= WOT for some A. (2.38) 


As is clear from the earlier discussion, Eq. (2.38) is equivalent to 
Eq. (2.37) and not some kind of approximation to it. The decrease 
of a(Q) as Q — co, due to (a) being negative, is the justly famous 
“asymptotic freedom” property. 

With A(Q) and B(A) replaced by a(Q) and (a), respectively, 
Eq. (2.19) becomes precisely the conventional RG equation for a 
physical quantity; 

Ge i + aa) ) c=. (2.39) 


As remarked before, this equation means that the physical quantity 


g is actually independent of u; the explicit u dependence cancels with 
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the implicit u dependence via a(n). From one-parameter-ambiguous 
DA we could have written down immediately the solution of this 
partial differential equation as Eq. (2.14). 

For example, consider the physical quantity Re+e-(Q) = 
Otot(ete — hadrons)/o(ete~ — pty) at total center-of-mass 
energy Q. According to one-parameter-ambiguous DA, this dimen- 
sionless function of one massive variable Q must have the form 


Reve (Q) = f (alu) for any p (2.40) 


R.+e-(Q) has a sensible free-field-theory limit, namely the parton- 
model result (3 X; q?), where q; is the electric charge of the ith quark. 
(Note that the parton-model result is a Q-independent constant 
because ordinary DA then applies.) If we write 


Rael a= (93 > q ar (+R), (2.41) 


then R represents the QCD correction. It must be a function only of 
-= and a(j), and has a perturbation expansion: 


R= ro (=) (1 +rı(2) (=) +- =) (2.42) 


with coefficients that are calculable from Feynman diagrams (rọ 
turns out to be 1). The perturbative coefficients, r1, ..., depend on 


the ratio 2. This dependence, as we shall see in the next chapter, 


is through In@ terms, so that if Q becomes very large relative 
to u, the perturbative coefficients become horribly large. Thus, in 
practice — not having an exact result, but only the first few terms of 
a perturbation series — our choice of the arbitrary scale u actually 
matters. Rather than fixing  once-and-for-all, we should allow it to 
“run” with Q, so that R,+,-(Q) depends only on a(Q): 


n= (22) (rina (A) ay 
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This is known as “renormalization-group-improved” perturbation 
theory. In an asymptotically free theory the “running coupling 
constant” a(Q) tends to zero as Q tends to infinity, so that RG- 
improved perturbation theory is good at high energies. Thus, we 
can definitely say that the leading asymptotic behaviour is R ~ 
a(Q)/m ~ 1/(m% In Q). 

However, the RG/DA argument only sets the scale for the 
running coupling constant. One can equally well show, by putting 
u = nQ in Eq. (2.40), that Re+e-(Q) is a function only of a(nQ), 
where n could be any constant (or, indeed, any function of a(js)). If 
we had exact results, or if we only asked about Q —> o, the choice 
of n would not matter at all. But in practice, since we want to get 
quantitative results from a truncated perturbation series, the choice 
of n does matter. This choice is entangled with the other RS choices 
involved in the definition of a, and is just one aspect of the wider 
RS-dependence problem. 


At leading order, when the correction coefficient rı is unknown, 
one can only guess at a “reasonable” n: Leading order is thus only 
a qualitative approximation. While good phenomenology can be 
done — and was done — with just leading-order results, quantita- 
tive results require at least next-to-leading order calculations. 


Appendix 2.A: Theories with Masses 


Our discussion in this chapter was specific to massless theories. It 
might at first seem that, if the Lagrangian contains a mass term, 
then all the previous arguments are undermined. However, this is 
not true since the bare mass, being ill-defined (infinite or infinitesimal 
depending on regularization), cannot directly provide a finite mass 
scale, any more than the bare coupling constant could. 

In fact, the previous picture still holds, except that the theory 
now has a two-parameter ambiguity. Generalizing the previous 
procedure, one can isolate these ambiguities in two renormalized 
quantities. These are most naturally chosen to be a coupling constant 
a(u) and a “running mass” m(u). Each of these functions is specified 
via a first-order differential equation, whose boundary condition must 
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be determined by experiment — through measurements of A and the 
physical mass Mphys, respectively. 

Other quantities must then be expressed in terms of a(n), 
m(n), and u, leading to generalizations of the RG equations (2.19) 
and (2.28) containing an extra —7,,m(0/Om) term, where Ym is 
somewhat analogous to the 6 function: wdm/du = —y%mm(p). 
This leads naturally to Weinberg’s form of the RG equation. A 
proper field-theoretic derivation of this equation gives the vital extra 
information that both ym and the 8 function depend only on a(y). 

To derive the Weinberg RG equation field-theoretically, one 
works in a “mass-independent” renormalization scheme (e.g., the MS 
scheme), which does not involve the physical mass in its definition. 
Alternatively, one may fix the mass to be the physical mass and 
use the original formulation due to Callan and Symanzik, but this 
involves the specifically field-theoretic concept of mass insertions in 
Green’s functions. 

Although the focus in this book is mostly on massless theories, 
the optimization procedure of Part II can be applied directly to a 
massive theory if the mass renormalization uses the physical mass. 
However, there might be further advantages in generalizing the 
procedure so as to optimize both the running coupling and the 
running mass. 


Appendix 2.B: Gauge Dependence in Gauge Theories 


Here we return to the problem of the gauge dependence of unphysical 
quantities which was avoided in Sec. 2.7. This can be treated 
by analogy with Appendix 2.A’s discussion of mass terms. To 
quantize a gauge theory one must add a gauge-fixing term to the 
Lagrangian. This step effectively introduces another sort of bare 
coupling constant — the bare gauge parameter € — which affects 
only unphysical quantities. The bare gauge parameter must itself be 
renormalized, leading to a renormalized gauge parameter €(j1). The 
RG equation for unphysical quantities will contain, in addition to an 
anomalous dimension, a term 6(a,&)(0/0€), where ô is the analogue 
of the £ function: WOE /Op = (a, E), with the partial derivative taken 
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at constant a. Observable quantities, being gauge invariant, do not 
depend on (u), so their RG equations do not need this term. 

Some RS’s explicitly involve the gauge choice in their definition. 
In such schemes, there are two ways in which one might define the 
8 function, depending on whether the derivative in wOa/Op is taken 
holding constant the bare or the renormalized gauge parameter. It 
is important for our later analysis, which will use only renormalized 
quantities, that the latter definition is used. 


Chapter 3 


Renormalization as Reparametrization 


3.1. Introduction 


This chapter aims to describe renormalization conceptually, with 
technicalities kept to a minimum. It is intended to complement the 
presentations in QFT textbooks, which should be consulted for more 
concrete, technical details. 

Consider a physical quantity in QFT, such as a scattering cross 
section o, that can be calculated in perturbation theory. It will be 
a function, not only of various kinematic variables (the centre-of- 
mass energy, the scattering angle, etc.), but also of the parameters 
in the theory’s Lagrangian, such as the particle masses and coupling 
constants. For simplicity, let us consider a theory with just one mass 
parameter m and one coupling constant g. 

The physical quantity ø can be calculated in perturbation theory 
from the relevant Feynman diagrams. The leading order result 
comes from one or more “tree” diagrams, and is seemingly a well- 
defined, finite result, co. However, the higher-order terms come from 
diagrams involving loops. These loops give rise to integrations over 
the loop’s 4-momentum and these integrals often fail to converge 
in the ultraviolet. Thus, at first sight, the result that emerges from 
perturbation theory seems meaningless: 


o = oo + 97 (oo?!?) +--+. (3.1) 


This is an open access book chapter published by World Scientific Publish- 
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Historically, in quantum electrodynamics (QED), the leading term oo 
was found to agree quite nicely with experiment, but when theorists 
calculated the “correction term” and found that it was infinite there 
was consternation. 

The problem lies entirely in the fact that we are trying to 
parametrize o in terms of the mass and coupling-constant parameters 
present in the Lagrangian — and these are not the mass and coupling 
strength that one could measure in any actual experiment. We will 
henceforth add a “B” subscript to stress that these are “bare” 
quantities. They are not measurable quantities at all, and therefore 
need not be finite. There is no need to be frightened of this. It is 
one of the great lessons of modern physics that only the physically 
observable quantities of a theory need be real and finite (and 
agree with experiment); successful theories may well involve other, 
unmeasurable objects that are complex, or infinite, infinitesimal, 
ambiguous, or otherwise bizarre. For example, quantum mechanics — 
unlike classical mechanics — intrinsically involves complez-valued 
amplitudes, but all its predictions for observable quantities are real. 
In quantum field theory, there turn out to be various theoretical 
objects that are not even finite, but that need not bother us provided 
all the physical predictions are finite, real, and sensible (and agree 
with experiment). 

The bare mass mg is the mass that the particle would have in 
the unperturbed theory — that is, in the free-field theory obtained 
by setting gg = 0. In the full, interacting theory, however, the 
propagator is given by a sum over all possible two-legged Feynman 
diagrams — representing the particle interacting with itself and with 
fluctuations of the vacuum (see Fig. 3.1). These interactions shift the 
position of the propagator’s pole, and hence change the particle’s 
mass. The particle’s physical mass Mphys — the mass measurable in 
an experiment — corresponds to the position of the pole in this full 
propagator. 

The “bare” coupling constant gp is the coupling constant for a 
“free” particle, or rather, a particle that is allowed to interact once, 
and once only. That is a somewhat absurd and unphysical concept, 
because as soon as we “turn on” the perturbation the particle will 
interact, with itself and with vacuum fluctuations, arbitrarily many 
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Fig. 3.1. The full propagator is a geometric sum of self-energy insertions, where 
the self-energy is the sum of all 1-particle irreducible diagrams — ones that cannot 
be separated into two pieces by cutting just a single line. (The external legs of 
the latter diagrams are “amputated”; they have no propagator factors associated 
with them.) For illustrative purposes we show diagrams in a QED-like theory. 


-o< €D- 


Fig. 3.2. The effective coupling strength will be affected by interactions that 
“dress” the bare vertex. A few illustrative examples are shown. 


times. Thus, any attempt to measure an “effective coupling strength” 
gest (however one might choose to define it, experimentally) will be 
measuring something that involves a sum of infinitely many Feynman 
diagrams (see Fig. 3.2). 

For the theory to be viable, the measurable quantities mpnys and 
geg need to be finite. But there is no reason to demand that mg and 
gp be finite — because there is no way, even in principle, to measure 
them. The problem of infinities in the calculation of a physical 
quantity o is simply an artefact of the result being parametrized 
in terms of mg and gp, which are not themselves finite quantities. 
All we need to do is to reparametrize the result for ø in terms of 
parameters that are finite, such as Mphys and gep. In fact, it is not 
necessary to use parameters that are truly measurable quantities; it 
is enough that we use suitable “renormalized” parameters mp, gR 
that are defined so that they are related in a finite and definite way 
to the physically measurable mphys and gefr- 


RENORMALIZATION = REPARAMETRIZATION 


o1(mB, gB) o1(mR, gr) 


mr = fm(™B, gB) 
Se ` — Experiment 


gR = fa(mB: gB) | z 


Theory 
L(mB, gB) 


on(™B, 9B) N(MR, gR) 
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Predictions for N physical quantities o,,...,0n are calculated 
from the theory. Initially, when parametrized by mp,gp, they 
contain divergences. However, once reparametrized in terms of 
suitably defined “renormalized” parameters mpr,gr, they become 
finite functions of finite parameters. They can then be fitted to 
experimental data, providing N — 2 tests of the theory, together 
with “best-fit” values for the Mmr, gr parameters. 


If we had a Black Box that could solve the theory exactly, 
then this situation would be evident. The physical predictions of 
the theory are, in fact, finite, but to make this fact manifest one 
must make a substitution, eliminating mg, gp in favour of mp, gr. 
It is just a substitution and nothing is being thrown away or “swept 
under the rug.” The term “renormalization” is really a misnomer, 
and “reparametrization” would be a better name. 


The above statement should be qualified by saying that it 
applies to the physical quantities of the theory. Green’s functions 
are not made finite by reparametrization alone; they also need to 
be rescaled by a suitably infinite overall factor. That is, they need 
an actual change of normalization, a true renormalization. Each 
leg of an n-point Green’s function needs a suitably infinite factor 
a =, where Zw is the so-called “wavefunction renormalization” 


constant, so that the whole n-point function is rescaled by a j 
(See Appendix 3.A.) 


If we could solve the theory exactly, it would not matter exactly 
how we define the mR, gr parameters. Any definition that does the 
job of manifesting the finiteness of the theory’s predictions would be 
acceptable. Changing from one definition to another would change 
the appearance of our results, but not their content. However, when 
we use perturbation theory, our gr parameter plays a double role: It 
is not only one of the free parameters of the theory (taking over that 
role from gz) it is also the expansion parameter of our approximation 
procedure. Because of this second role, the choice of definition of 
gr does matter in practice. We return to this central concern in a 
moment. 
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3.2. Regularization 


We have talked quite cavalierly about “divergences” and “infinities,” 
but in order to be doing proper mathematics we need a procedure 
known as “regularization.” This step involves many technicalities — 
which often dominate textbook accounts of renormalization and tend 
to obscure what is really going on. 

A “regularization” is some systematic, consistent, mathematical 
modification of the theory that renders the divergent integrals conver- 
gent. One example is some kind of “ultraviolet cutoff” procedure that 
puts an upper limit on the magnitude of the momentum allowed in 
the loop integrals. We will use “Mav” to denote the ultraviolet cutoff 
parameter. Other regularization methods modify the free propagator, 
using a large parameter Mav that acts much like a cutoff. Another 
type of regularization is dimensional regularization, which formulates 
the theory in d space-time dimensions, where d is allowed to be non- 
integer and is taken to be 4— €, with e — 0. We will return to discuss 
dimensional regularization later. 

Regularization is like scaffolding that holds up a rickety building 
while its structure is being strengthened. It replaces meaningless 
divergent integrals with well-defined mathematical expressions that 
are finite, but which would become arbitrarily large in the limit 
My — oo. After one has made the reparametrization step — 
eliminating mz, gg in favour of mr, gr — one finds that cancellations 
occur, so that the limit Mav — oo can then be taken, yielding 
manifestly finite results. 

It is not necessary that the regularized theory be a physically 
acceptable theory. Indeed, the regularized theory is often very ugly 
and may violate some symmetries of the original theory. One has to 
take care that important symmetries, such as Lorentz invariance and 
gauge invariance, are indeed recovered when the My, — oo limit is 
taken. 


However, there are some cases of “quantum anomalies,” where 
a symmetry of the bare Lagrangian — which would certainly be 
a symmetry of the corresponding classical field theory — cannot 
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be maintained in the regularized, quantum theory even in the 
My, — oo limit. Dimensional transmutation, whereby a scale- 
invariant Lagrangian gives rise to a theory with a characteristic 
scale A, is one example of a quantum anomaly. Chiral symmetry is 
another case where anomalies arise. 


It is very important to be clear that regularization, though 
a technical necessity, is not physics — just as scaffolding is not 
architecture. In the end the regularization is removed, revealing the 
theory’s finite predictions — just as scaffolding is eventually taken 
down to reveal the newly renovated building. 


3.3. Defining a Renormalized Coupling Constant 


There are many, many ways of going about defining a renormalized 
coupling constant gr. One traditional approach, known generically 
as “momentum subtraction” (or MOM) schemes, starts from the 
observation that, since the bare coupling constant gg corresponds 
to the bare 3-particle vertex, it is natural to define gr in terms 
of a “dressed” 3-particle vertex, involving a sum over all possible 
Feynman diagrams with three (off-shell) legs. Conceptually, this 
idea is helpful — but attempting to develop it into a specific 
definition of gr leads one into a deep quagmire of technicalities. 
For example, in QCD, the bare coupling gg is associated with 


various vertices quark—quark—gluon, 3-gluon, and ghost—ghost— 
gluon vertices, as well as the 4-gluon vertex, proportional to g3. 
Which of these vertices, or combination thereof, should we choose? 
Then there is the question of the momentum carried by each leg: 
Unlike the bare vertex, the dressed vertex depends non-trivially on 
the momentum configuration. There are many other technicalities, 
such as the spin-index decomposition, etc. As an example, a typical 
MOM scheme might define gr as “the coefficient of the Dirac matrix 
y" in the decomposition ... of the quark-quark-gluon vertex at the 
“symmetric point” where the 4-momenta pf, ph, p§ satisfy 


pi = på = pe =—3p", pi-P2 = p2-P3 = p3-Pi = iM’, (3.2) 


in the Feynman gauge with ....” (We have suppressed some of 
the more horrific technical details with “...”.) Note that the 
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renormalization conditions can be specified without reference to any 
particular regularization procedure. 

The most important aspect of this, or any such definition, is that 
gr will depend on a scale u, known as the “renormalization point” 
or “renormalization scale,” that sets the overall energy-scale of the 
dressed vertex. That is 


gR = gR(p, other choices). (3.3) 


We will later use the terminology these other choices are the 
“renormalization prescription” (RP). When we say that gr depends 
on renormalization scheme (RS), we mean that it depends on p 
and the “other choices” made in defining gr. Indeed, for physical 
quantities, “specifying a RS” means the same thing as “defining a 
renormalized coupling constant, gr.” 


One should beware of claims to have defined an “RS-invariant 
renormalized coupling constant.” That is simply a contradiction in 
terms. The definition of a coupling constant is a renormalization 
scheme, and vice versa. We return to this point at the end of the 
chapter. 


Let us next consider calculating gr in perturbation theory, 
assuming that we have adopted some specific RP. The first term is 
just gg, but there are then corrections involving Feynman diagrams 
with two more vertices (and hence two more powers of gg) and all 
contain one loop. These loops result in integrals that are typically 
divergent and so will need regularization. That is, we will find a result 
of the form 


gr = 9B(1+ gpl (mu) +), (3.4) 


where [ (u) is some divergent integral, or sum of divergent integrals, 
rendered large but finite by the regularization. I(:) will depend on 
the specific choices in the gr definition adopted; in particular it will 
depend on the scale u. It is convenient to work with the square of g 
and absorb some 7 factors by defining 


2 
g 
(3.5) 
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for both B and R cases. Squaring Eq. (3.4) then leads to 
in = ae e E (3.6) 


where I(j:) = 87?I (u). Inverting this equation (“reversion of a power 
series”) gives 


ag =ap(l—aprl(y)+---). (3.7) 


3.4. Renormalizing a Physical Quantity 


Now consider the calculation of some physical quantity, such as 
a two-particle scattering cross section o. It will depend on some 
kinematic variables, which we can take to be the total centre-of- 
mass energy Q and scattering angle 0. At lowest order the result is 
given by a tree diagram, but then there are corrections from one-loop 
diagrams, which again will have loop integrations that in general do 
not converge, in the absence of regularization. Thus, we will find a 
result of the form 


o = (factor) Gal (1+ I J(Q, 0) +--+), (3.8) 


where J is some sum of divergent integrals. (That is, J depends 
sensitively on the regularization parameter Mav, and goes to infinity 
as Mav — oo.) For two-particle scattering, the power P would be 2, 
but more generally it could take other values and “0” could represent 
a set of several dimensionless variables. Converting from gp to ag = 
9%/(4n), and removing an overall factor, we can define a physical 
quantity R in the general form 


R =a (1+ag J(Q,0) +), (3.9) 


with J = 47? J. 

Renormalizing this result is simply a matter of eliminating apg 
in favour of ag using Eq. (3.7). We spell out the simple algebra 
explicitly: 


= (ag(1 — ar I(u) +--+)? (1+ar J(Q,0) +) 
=aR(l — Parl (u) +--:)(1+ar J(Q,0)+---) (310) 
arn +ar(J(Q,8) =P]: 
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The “+---” terms all have two or more factors of ar relative 
to the 1, so that formally they are “O(a}).” What happens, in 
a renormalizable theory, is that the divergences in J(Q,0) cancel 
with those of PI(u). Thus, we may now take the limit where the 
regularization goes away to obtain a finite coefficient 


rı =71(Q,9,u) = lim [J(Q,6) — PI(p)]. (3.11) 


Muy oo 


Similar cancellations occur in higher orders, so that the result 
appears as a power series in a = ag that has finite coefficients: 


R=a’(1+riat-::). (3.12) 


Here we have dropped the R subscript, leaving it understood that a 
stands for the (renormalized) couplant g? /(47?). We use the word 
“couplant” because the phrase “renormalized coupling constant” is 
both cumbersome and misleading — a is not a constant; it is a 
function of the renormalization scale u, and of the “other choices” 
made in its definition, as indicated in Eq. (3.3). 


The proof, first for QED, and then for QCD and other non- 
Abelian gauge theories, that the needed cancellations do indeed 
occur to all orders is, of course, a highly technical matter. It repre- 
sents a triumph of theoretical physics, to which many distinguished 
physicists made important contributions. Here, we simply rely on 
their result that, in the theories of interest, the needed cancellations 
do occur. 


3.5. The Renormalized Result and Its Ambiguities 


The coefficient rı in the last equation depends not only on the 
physical variables Q, 6, as we would expect, but also on u. However, 
the physical quantity R must ultimately depend only on the physical 
variables Q,0: it cannot depend on u and the other arbitrary 
choices, as is evident from the original “bare” form Eq. (3.9). In 
the renormalized result, what happens is that the u dependence in 
the coefficients r; cancels with the u dependence from the couplant a. 
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That fact is expressed by the RG equation 


Ka 
ET 


where (a) = uda/dp. (Note the slight change in the normalization 
of 6 relative Eq. (2.39) of the previous chapter.) Here the first term 
picks up the u dependence via the r; coefficients while the second 


+ ata) 5.) R=0, (3.13) 


a 


term picks up the u dependence via a. 

From the discussion in the previous chapter, we could have 
anticipated that the answer would appear in a form that seemingly 
involves an arbitrary scale u. If we could solve the theory exactly, 
there would be no problem, since the exact R is exactly independent 
of u. The difficulty arises because, in practice, we can calculate 
only a finite number of terms in the perturbation series. Since 
the p-dependence cancellations occur across different orders, they 
are spoiled when we truncate the series, leaving the approximant 
dependent on p. 

The same point applies to the “other choices” involved in the 
definition of gr. The couplant a and the perturbative coefficients 
ri will each depend on these choices, but the exact result cannot. 
If we had made different choices when defining gr, our renormalized 
couplant, a’, would have been different; instead of Eq. (3.6) we would 
have had 


a’ =ap(l+apl'+---), (3.14) 


where J’ has the same divergent part as J, but a different finite part. 
Thus, eliminating ag to express a’ in terms of a, we would find 


a’ =a(1+uja+-:-), (3.15) 


where vı = I' — I is finite, but otherwise arbitrary. 
If we had used the primed RS in the calculation of R we would 
have obtained a result 


R=a'(1+ria'+---), (3.16) 
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where r{ is a different finite coefficient. It is given by r| = rı — Pv, 
as is easily seen: 


R=" (triad +) 
=(a(1+va+:--))? (1+ria+--) (3.17) 
=al (1+ (ri +Pujat:-), 


so that rı must be ri + Pv}. 

A basic issue, that can cause much confusion, is the following. A 
change in u affects the second-order coefficient r1, but a change of 
the “other choices” — the RP — can do so, too. However, there is 
only one degree-of-freedom at this order, corresponding to the single 
arbitrary coefficient vı in Eq. (3.15). The resolution of this issue can 
only be properly explained later on. 


In Chapter 6, when we have properly defined the A parameter 
that specifies the boundary condition to the 8 function, we will 
find that A is RP dependent — in just the right way to make 
rı dependent on RS only through the ratio p/A. Any “other 
choices” that leave this ratio unchanged will only affect the higher 
coefficients r2,.... Chapter 7 will explain how to parametrize those 
“other choices.” 


3.6. The Cancellation of Divergences, Revisited 


Let us revisit Eq. (3.6), the expression for the renormalized couplant 
in terms of the bare one: 


a=ap(l+apl(p)+---). (3.18) 


and ask how it can be that the divergent integral J(u) depends 
on a mass-scale u. The issue is starkest in a massless theory, such 
as massless QCD, where there is no dimensionful parameter in the 
theory’s Lagrangian: What is there to form a ratio of u with? The 
answer is that J(u) is a divergent integral and can only be given 
a meaning by “regularizing” the theory, and regularization always 
introduces a scale — the cutoff Mav. In an actual Feynman-diagram 
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calculation, one would find that J(u) has the following form, for 
large Mw: 


I(u) = bln(Ma/u) + w + negl. (3.19) 


Here b and w are finite coefficients that are independent of u, and 
“negl.” are negligible terms that vanish as My, — oo. Note that 
there is a logarithmic divergence as Myy — co, and the argument of 
the logarithm is necessarily a dimensionless ratio of Muay with p. 

The fact that the u dependence of J is logarithmic could have 
been anticipated from Eq. (3.11), where rı = J(Q,6) — PI (u) must 
end up as a function of the ratio of Q/j once My, has been taken to 
infinity. The key cancellation is In(Myy/w) — In(Muy/Q) = In(Q/p). 
Hence, we can predict the form of J(Q,6) to be 


J(Q, 0) = PbIn(My, /Q) + j(0) + neg!’ (3.20) 


The coefficient b above is, in fact, the leading coefficient of the 8 
function, as we can quickly see. Differentiating Eq. (3.19) gives 


dI 
Be (3.21) 
so that differentiating Eq. (3.18) gives 
_ da 2 al 3 
Bla) = aT aB ET + O(ap) 
= —ba?;(1 + O(apg)) (3.22) 
= —ba?°(1 + O(a)), (3.23) 


where the last step uses ag = a(1 + O(a)) to eliminate ag in favour 
of a, so that no reference to bare quantities remains. In higher orders 
we would find a perturbation series for £: 


B(a) = —ba?(1 + ca + cpa? + cza? +--+), (3.24) 


with finite coefficients. The 6 function represents the “anomalous” u 
dependence of the renormalized couplant arising from the presence 
of ultraviolet divergences in the unrenormalized theory. 
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3.7. Dimensional Regularization 


We have so far framed our discussion in terms of cutoff-type proce- 
dures. Those regularization methods have the conceptual advantage, 
for our purposes, of cleanly separating the regularization procedure, 
and its technicalities, from the RS — the definition of gr and 
hence of a — which can be specified by “renormalization conditions” 
independent of the particular regularization. However, there is no 
doubt that for actual calculations, dimensional regularization, and 
its variants, are far more convenient. 

Dimensional regularization considers a generalization of the 
theory to a non-integer space-time dimension d = 4 — e. (This 
generalization is by no means unique, and a whole set of rather arbi- 
trary conventions must be specified.) Removing the regularization 
corresponds to taking the limit € — 0. The divergences — that in 
a cutoff regularization appeared as logarithms of the cutoff Mav — 
now appear as poles in L, The unrenormalized perturbation series 
for a physical quantity has the form 


R R R 
R=ag (14 (FB + mo) an + (B+ + fn) abe), 
(3.25) 


where the Rj;’s are various coefficients. (R11 is b, in fact.) 

At first sight the fact that the regularization parameter is now 
the dimensionless € would seem to undermine our argument, at the 
start of the preceding section, that regularization always introduces 
a mass-scale. However, a mass-scale really is involved because in d 
dimensions the bare coupling constant gg in the Lagrangian has 


—d/24+2 


dimensions [mass] = [mass]‘/?. Hence, if the renormalized 


couplant, a, is to be dimensionless we will need to write 
ap =poa(l—Tla+---), (3.26) 


where uo is some “unit of mass.” In the same way, in any dimensional- 
regularization calculation one repeatedly makes the small-e expan- 
sion of kf, where k is some momentum or other massive variable, as 


“Ke =1+elmnk+ O(e*).” (3.27) 
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While this is perhaps mathematically valid, it makes no dimensional 
sense, and is really 


k® = mC + €ln(k/po) + O(€*)), (3.28) 


where, again, uo is some “unit of mass.” If one defines a() 
by some regularization-independent renormalization conditions, the 
I() integral becomes a function of the ratio u/uo, while J(Q,6@) 
becomes a function of Q/uo, with uo cancelling out in ry. 

However, in the well-known renormalization scheme called “min- 
imal subtraction” (MS) the story is a bit different. The MS scheme 
corresponds to calculating physical quantities, such as ø, in dimen- 
sional regularization (with some very specific choices of the details) 
and then simply discarding the pole terms L, +, .... While this might 
seem to be a blind disregard of infinitely large quantities, it is in 
fact equivalent to an ag-to-a substitution with a definition of a that 
involves only the pole terms, with no finite parts. An even more 
popular scheme, “modified minimal subtraction” or MS, subtracts 
the (ln 4r — yg) terms, where yg is Euler’s constant, that naturally 
accompany each 1 pole. (This procedure could be recast as a minimal 
subtraction in a modified dimensional regularization with a different 
convention about the generalization to d dimensions.) 

In such schemes the definition of the RS is entwined with the 
specification of the regularization procedure. There is nothing wrong 
with that, provided the conceptual distinction between regularization 
(which is removed in the end) and renormalization (the definition of 
the couplant) is kept in mind. One may dispense with the distinction 
between uo and pu. 

In the minimal subtraction framework, as ’t Hooft has shown, 
there is a strong formal argument that the bare coupling constant 
is infinitesimal, of order e. (See Exercise 3.2.) The result can be 
expressed as 


ap = 15> (1+e+m= + O(0)) . (3.29) 


Note that the scale parameter jig is really unspecified because 
changing it by a finite factor only affects the O(c) correction term. 
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We can then understand how the apparently nonsensical “bare” 
series, Eq. (3.25), with its divergent series coefficients can actually 
be hiding a finite result. Indeed, the bare expansion is akin to the 
extreme limit of a very bad RS where the couplant is far too small, 
so that the series coefficients are far too big. 


3.8. Perturbation Theory, the Cutoff — co Limit, 
and Asymptotic Freedom 


Perturbation theory in a QFT involves two limits. One is the limit 
in which the regularization is removed; that is, € — 0 in dimensional 
regularization, or May — œ in cutoff-type regularizations. The other 
is the perturbative limit in which, formally, a — 0. We should 
perhaps ask if these two limits commute or not. 

The answer, we would argue, depends on whether the theory 
is asymptotically free or not; that is whether the coefficient b in 
b(a) = —ba? +--+ is positive or not. 

As we saw in Sec. 2.8 the leading-order form of the effective 
couplant is 


a 3.30 

o~ FMON se 

Thus, for an asymptotically-free theory the good region for pertur- 
bation theory is at large energies, Q >> A. The formal perturbative 
limit, a — 0, corresponds to pretending that A — 0, so that 
perturbation theory is then good at any Q. The two limits can be 
expected to commute since the scales A and My, and are being sent 
off in different directions; to 0 and oo, respectively. See Fig. 3.3(a). 
In non-asymptotically-free theories, however, the good region 
for perturbation theory is at low energies, Q < A. (Recall that 
a is proportional to g? and should be positive, as well as small.) 
Thus, here the perturbative limit corresponds to sending both My, 
and A to oo. See Fig. 3.3(b). For the “real” theory we should 
send My, — oo first, but perturbation theory takes A — oo 
first. Consequently, perturbation theory in non-asymptotically-free 
theories is problematic. Indeed for \¢* theory and QED (viewed as 
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pi Q - Q 
(a) (b) 
Fig. 3.3. The leading-order effective couplant in (a) an asymptotically-free 
theory, and (b) a non-asymptotically-free theory. Perturbation theory is good 
in the unshaded region and becomes good at all Q in the limit that the A scale 
is taken to (a) zero or (b) infinity. In case (b) this limit conflicts with the cutoff 


to infinity limit, making perturbation theory dubious in non-asymptotically-free 
theories. 


a self-contained theory) the “real” theory is believed to be “trivial” 
in the technical sense that it can have no interactions. 

The present general attitude to perturbation theory in non- 
asymptotically-free theories is that it is valid provided that such 
theories are viewed as “effective low-energy theories” that approx- 
imate some unknown, grander theory with quite different ultraviolet 
behaviour that sets in at some finite, but very large, physical energy 
scale Mav. The regularization is then not some mere technical device, 
but is a crude representation of some actual physics at very high 
energies. (In terms of our “scaffolding” analogy, the scaffolding here 
is load-bearing, and it can’t be entirely removed without causing the 
collapse of the whole structure.) Perturbation theory in this effective 
theory is meaningful provided that the A scale is much greater than 
the Muay scale. Of course, the uncertainties in what is the right 
regularization procedure, which now physically matters, are present 
alongside RS ambiguities in such theories. This book is principally 
concerned with asymptotically-free theories, where those issues do 
not arise. 


3.9. “RG-improved” Perturbation Theory and the 
RS-dependence Problem 


Returning to the renormalized result for the physical quantity R: 


R=aF(1+riat+--:), (3.31) 
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we now have a series with finite coefficients in a finite expansion 
parameter, a. However, the triumph of renormalization in rendering 
the result manifestly finite is tarnished by the fact that all the 
series coefficients r; are ambiguous and completely dependent on 
the arbitrary RS choices. That would not matter if all orders could 
somehow be calculated and re-summed, because R is RS invariant. 
However, there is a serious RS-dependence problem when we truncate 
the series, as we must in practice. 

As noted earlier, the coefficient rı depends on u/Q through a 
term bln(u/Q). Thus, rı will have a very large magnitude if the 
physical scale Q is much, much larger (or much, much smaller) 
than the renormalization scale u. The apparent convergence of the 
perturbation series would then be very bad. Because of this “large 
logarithm” problem, it is a bad idea to fix u once-and-for-all; rather, 
it should be allowed to “run” with Q in some fashion. Simply 
setting u = Q leads to “RG-improved” perturbation theory, an idea 
originating with Gell Mann and Low and developed and exploited 
by the pioneers of QCD. 

It is undoubtedly a major improvement — and was hugely 
important to the development of physics — but it does not resolve 
the ambiguities. Indeed, the idea of avoiding “large logarithms” only 
goes so far. Why should we set u = Q, rather than, say u = 2Q, 
or u = Q/4, etc.? Indeed, what, precisely, is “Q”? We have only 
said that Q is some kinematic variable associated with the physical 
quantity R, but which is the “right” one? Equally, we have said 
only that the renormalization scale u is some parameter, with the 
dimensions of mass, introduced in the renormalization procedure. 
If one tries to address, separately, these three issues — what is 
the “right” Q, the “right” u, and the “right” relationship between 
them — one finds oneself sinking into a quagmire of ambiguities. 
This is not a useful way to approach the RS-dependence problem. 

Moreover, the question of the “other choices” — the RP — would 
still remain. A frequent suggestion is to use a physical quantity to 
define a “physical” RP: Take some process A with a normalized phys- 
ical quantity Ra =aP(1+---) and define a4 = R: One can then 
make predictions for another process B as a truncated perturbation 
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series in the “physical couplant,” a4. This is a superficially attractive 
idea, but it does not survive scrutiny. Why should we choose process 
A as the special one? Why not processes C, D, E,...? One is still 
left with an infinite number of differing predictions for process B. 
This is just the RS-dependence problem all over again: nothing has 
been resolved. (To say nothing of the issue raised above: How is the 
renormalization scale for process B to be related to the “Q” of process 
A?) The definition of a, is just a way of specifying a RP. Indeed, 
” 


the phrase “physical couplant” or “scheme-invariant couplant” is a 


contradiction in terms. 


An analogy with Lorentz invariance may help here. Energy, 
by its nature, is not an invariant quantity; it transforms from 
one reference frame to another. The rest mass of a particle is 
Lorentz invariant, and can be calculated from the energy E and 
3-momentum p of the particle in any frame, as the combination 
\/ E? — p°. Now, there is a particular frame, the rest frame, in 
which p = 0 and E = m. Have we then discovered a “Lorentz- 
invariant energy”? Of course not: we have just defined a particular 
frame. In the same way, a couplant, by its nature, transforms 
under changes of RP. The definition of a4 above does not define a 
“scheme-invariant couplant” — it defines a RP. 


It is often tacitly assumed that the RP should be fixed once-and- 
for-all. In our view, that is a bad mistake. Just as u should “run” 
with the energy scale Q, so the RP should be free to depend on the 
particular quantity R being calculated. This point should become 
clearer in the next chapter. 

The reader may well find this section rather confusing and 
unsatisfactory. Indeed, its purpose is partly to give a flavour of 
the fog of confusion that too often surrounds the subject. The RS- 
dependence problem can only be discussed more productively when 
some fundamental matters — discussed in Chapters 6 and 7 — have 
been explained. 

All these issues come into much better focus when RS-dependent 
perturbation theory is seen as a “non-invariant approximation.” The 
quantum field theory itself does not have a problem — its exact 
results are RS independent. Thus, the answer is not to be found in 
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the theory. The problem is with the approximation: The exact result 
is independent of the various RS variables, but the approximant 
depends upon those variables. Such “non-invariant approximations” 
actually occur in a wide variety of contexts, and much can be learned 
from some simple examples. That is the topic of the next chapter. 


Appendix 3.A: Renormalization in “Counterterm” 
Language 


Renormalization is more usually described in “counterterm” lan- 
guage. For a renormalizable theory this just means that we make the 
reparametrization step in the Lagrangian from the start. To illustrate 
in the simplest case we consider a massless scalar theory with a + 
interaction: 


1 
L= 5 LOB "$B — ABop. (3A.1) 


This bare Lagrangian is written in terms of the bare field and bare 
coupling constant. (Ag is the counterpart to ag in the QCD case.) 
We now substitute for these in terms of a renormalized field and 
coupling constant: 


dp =Z¢, ABSZ BiA (3A.2) 


Here the Z’s are called renormalization constants and they will have 
a perturbation expansion of the generic from 


Z=1tzrA+ ad? +e, (3A.3) 


where the 2, 22,... coefficients are divergent integrals. With this 
substitution the Lagrangian becomes 


1 
L= 5% 3p OW — Z1A¢", (3A.4) 
which we may then write as 
1 
L= 5 „o ON — Ao* + counterterms. (3A.5) 


The counterterms have the same form as the terms in the “renor- 
malized Lagrangian” but multiplied by (Zs, — 1) and (Zı — 1) 
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factors, respectively, which start at order A. From here on we 
need never mention the bare quantities. We may proceed to do 
Feynman-diagram perturbation theory treating both the A¢* term 
and the counterterms as perturbations. The counterterm coefficients 
21, 22,... are to be fixed so that they cancel the divergences. (Of 
course, that only fixes the divergent parts of the z;’s; the freedom to 
chose the finite parts corresponds to the RS ambiguity.) 

This formalism has advantages in organizing the calculation. 
Green’s functions with n legs renormalize by 


GO) (pi, ,n) = 25"? GY (i, AB, Mun); (34.6) 


while proper vertices (1-particle irreducible amplitudes) renormalize 
with a oe ? factor. For physical quantities the field normalization 


does not matter and only the composite Z = Z5 Z in Ap = ZX 
matters. (The counterterm formalism also has the advantage of 
generalizing to non-renormalizable theories: There the counterterms 
are not of the same form as the terms in the Lagrangian, and more 
and more are needed in higher orders.) 


Exercise 3.1. Generalize the equations of Secs. 3.4-3.6 keeping 
two orders of correction terms and show that one can expect both 
In?(Myy/p) and In(Muy/u) divergences, which — assuming the 
needed cancellations do happen — will leave In?(Q/j) and In(Q/,) 
terms in the second-order coefficient r2. 

Show also that the second coefficient of the 8 function can be 
found from the coefficient of the subleading In(M,,,/j4) divergence 
of the 2-loop term. 


Exercise 3.2. This exercise follows the analysis of ’t Hooft. In the 
minimal-subtraction scheme, if we group terms by powers of 1/e, 
rather than by powers of a, the expression for the bare couplant in 
terms of the renormalized couplant is 


sara 


Here ag is pi-independent, while use will be (a) plus corrections 


of order e. Multiply through by u~* and take u d/dp of both sides. 
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Then equate powers of € to show that 
da 
"ap 
and that the A;(a)’s are related to 8(a) by 
bla) = aA} — Aj, 
B(a)A;, = aAj 4 —Ajii1, g=1,2,.... 


= B(a) — «a, 


Show that these results are equivalent to the equation 


dap 
da 


Integrate this differential equation to show that 


aape -e f'a 1 -[% 
PETH BAR mE 5 “Bla) — ex 5 «vi? 


with ô > 0. Alternatively, writing (x) as —ba? B(x), one may 
express the result as 


— e "m BO) 
an = p'ap ( 7i ay) 


Hence, noting that B(x) ~ 1/(1 — cx) will be a sufficient 
approximation, obtain 


(Bla) — ca) = —eapg. 


an= $ (1+c%mé+0(0); 


(Note that € and b need to have the same sign for ag to be positive.) 
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Chapter 4 


Non-invariant Approximations and the 
Principle of Minimal Sensitivity 


4.1. Approximations and Series Expansions 


Before turning to non-invariant approximations specifically, some 
general consideration of approximations may be warranted. The 
discussion may seem insultingly elementary and our remarks rather 
trite, but we hope at least to convey that there are issues to be 
thought about. Theorists learn the mathematics of limits, asymp- 
totics, conditions for convergence of series, etc., but these theorems, 
while immensely important, are not quite the same as the question 
of what makes a good approximation. 

For instance, proving that the asymptotic behaviour of some 
function A(Q) as Q > œ is A(Q) ~ InQ does not really answer 
the question What is a good approximation to A(Q) when the energy 
Q is some specific, large value? To answer the second question one 
must decide, at least roughly, what reference scale Qo belongs in 
the argument Q/Qp of the logarithm, which fundamentally must be 
dimensionless. The choice of Qo is irrelevant to the mathematical 
limit, but it matters quite a lot for the practical question. Similarly, 
answering the mathematical question of whether a series converges 
does not answer the question of whether, say, the first three terms of 
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the series are likely to give a good approximation. Indeed, the second 
question may have very little to do with the first. 

Approximations arise in many ways, but the justification always 
traces back to the fact that some dimensionless parameter is small. 
There is a limit — though it may or may not be realizable 
physically — in which the parameter tends to zero and the approx- 
imation becomes exact. In many cases the approximation can be 
systematically improved and we find ourselves calculating a quantity 
f(x) as a power series, 


f(z) = fot fia + for? + for? t, (4.1) 


with our successive approximations being successive truncations of 
this infinite series. (Not all problems lead to such a form — the series 
could involve logarithms of x, or still more exotic apparitions — but, 
for simplicity, we will not discuss such cases.) When x is small we 
can hope that fo is a crude approximation to f(x), and that fo+ fix 
is a better one, with fo + fiz + fox? being better still, and so on. 

A key mathematical issue is whether the series is convergent; that 
is, whether the partial sums }°"" , fix’ converge to a limit as n > 
oo. Convergent series converge in a circular region of the complex 
x plane, for | x |< £e, where ze is the radius of convergence. Even 
if the series is not convergent it may be asymptotic. A prototypical 
example is the alternating factorial series: 


f(a) = a(1 — La + Qn? — Bla? + diet +---). (4.2) 


It has radius of convergence zero, since — no matter how small 
x is — the partial sum eventually becomes dominated by the last 
term added, with the result then alternating violently between large 
positive and negative values. However, for x small, the partial sums 
initially appear to converge towards a constant value (around 0.9156 
for x = 0.1, for example). Thus we can often get useful, even 
remarkably accurate, approximations from divergent series. 

It is important to distinguish two questions: Is the series 
convergent? (or, more generally, is it summable by such-and-such 
a method?) and Is the sum of the series the right answer? The 
first question depends only on the series, but the second cannot be 
addressed without knowing where the series comes from. 
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For example, consider the following fable. Suppose that the exact 
answer to some physical problem is given by f(x) at x = E, and 
it is known that f(x) ~ x as x —> 0. A complicated perturbative 
method is devised to calculate f(x) as a power series. After long 


calculations the first few terms are found to be 
f=zr(l-r+r? +---). 


A clever theorist then proves that the series coefficients, to all 
orders, are +1 in alternation. Resumming the series he then obtains 


f 


o 2 
~ 142’ 


which gives f (4) = 0.167. Sadly, though, this result disagrees 
strongly with the experimental result, 0.433, even though the series 
is convergent, and x = + is well within the radius of convergence. 

Another clever theorist then establishes that f(x) satisfies the 
differential equation 

d d 
z?(1+ nos —a(1— 302) +(1-—2)f =0, 

and that for large x there is another boundary condition requiring 
f 31447? as x > œœ. Thus, the actual answer is 


x 5 1 
f(z) = tpa 7 er, 
and the mystery is solved. 

In fact the equal signs in the earlier equations were quite 
inappropriate: The first was not f but the series generated from 
it, and the second was not f but the sum of that (convergent) 
series. The actual f has a large non-perturbative term, invisible 
to perturbation theory (the power series expansion of eT? is 
0+0+0+---). The moral of our fable is that the possible 
existence of such terms is an issue, irrespective of whether the series 
is convergent or divergent. Note that there is simply no information 
in the coefficients of the series about the non-perturbative term. 
No clever resummation method could have led, except by complete 
accident, to the true answer. 


For divergent series, the question of how to sum the series and 
the possibility of non-perturbative terms are somewhat intertwined, 
since there can be a natural sense in which non-perturbative terms 
arise from the needed resummation procedure, their form being in 
a sense implied by the rate of growth of the coefficients. However, 
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there are still two distinct issues, since there can always be non- 
perturbative terms that are just “tacked on” and about which the 
series coefficients know nothing. Thus, a proper discussion must 
start, not from the series, but from the mathematical problem 
(differential equation, functional integral, or whatever) from which 
the series arose. (See also Exercise 4.1.) Such issues will be relevant 
in Chapter 5. For now we focus more on approximations obtained 
from just a few terms of a series. 


4.2. Approximology 


Approximations are a gamble. In physics, as in life, we have to act 
upon incomplete information. Except in rare instances where rigorous 
inequalities can be proved, we can say nothing with certainty. 
Nevertheless, our duty is to obtain the best approximation we can, 
given limited time and resources. We should be clear, though, we are 
making a gamble. 

There are, however, good gambles and bad gambles. The guiding 
principle must be to make good use of any information we have. One 
should not bet on an outcome that is glaringly inconsistent with 
some known information. Even the simplest facts can be powerful 
information. 


For example, consider the classical ballistics problem of the range 
of a projectile with air resistance proportional to velocity. The 
projectile, mass m, is launched with a muzzle velocity vp at an angle 
of elevation a. Neglecting air resistance the range is Ro = 2UV/g 
where U = vocosa and V = vosina are the horizontal and 
vertical components of the initial velocity. Allowing for a drag force 
F = —Kv as a first-order perturbation gives an approximant to 
the range as 


Rx Ro (1 = 2K) 5 
where the dimensionless small parameter is 


2KV 
K = — 


mg 


This provides a satisfactory approximation only when « is much, 
much less than 1. Indeed, we can see that the approximant absurdly 
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predicts a negative range if k > 3. The fact that the exact result 
must be positive definite should be made use of. A simple way to 
do so is to modify the form of our approximant to 


Ro 
Ra 
=D) 


which leads to a much more robust approximation. This a simple 
example of a Padé approximant, which in general is the ratio of 
two polynomials in the small parameter. Padé approximants are 
not a universal panacea, but in this case there is a clear reason for 
preferring the Padé form. 


Approximations are never without uncertainties and ambiguities. 
There are always two questions to be decided: Which quantity do we 
want to calculate? and What form of approximant will we use? The 
two questions are interlinked. For example, in the ballistics example 
above the choice of the Padé approximant form is equivalent to 
choosing to calculate the reciprocal of the range, rather than the 
range itself, as a power series. Another example is the following. 
Suppose a quantity f has a power series fo + fix + fox? +--+, but 
there are reasons for thinking that f? is the more natural quantity. 
A second-order approximation to f? would be equivalent to using an 
approximant to f of the form (f?+2fofir+(2fofa + f2)a?)!/? rather 
than fo + fix + fox?. Similarly, if a quantity f is naturally the ratio 
g/h of two other quantities, do we want to calculate f or to calculate 
g and h separately? This matters because the approximation to f is, 
in general, not exactly the same as the ratio of the approximations 
to g and h. 

It is hard to give any general guidance on these issues, beyond 
saying that one should calculate what seems most natural and is 
most directly measured in experiments. The general principle should 
always be to make use of all information available in any specific case. 

Discipline is needed when making approximations. If a calcula- 
tion involves more than one approximation we should carefully dis- 
tinguish the primary approximation — which is the unavoidable step 
and the main source of error — from secondary approximations that 
might be made just for convenience. Such secondary approximations 
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are fine provided that they cannot contribute significantly to the 
overall error. 


In QCD, for example, the primary approximation is the truncation 
of the perturbation series. A common practice is to use a secondary 
approximation where the couplant is approximated by a truncated 
series in 1/In(Q/A). However that introduces uncontrolled errors 
and unnecessary ambiguities, as will be discussed in Chapter 6. 


It is always interesting to investigate approximations beyond 
their comfort zone. If an approximant becomes manifestly unphysical 
beyond some point then perhaps we are not making best use of 
our known information, and some modification would cure, or at 
least mitigate, the problem — as we saw in the ballistics example 
above. Even if the region of applicability of the approximation is 
intrinsically and unavoidably limited, it is good to know how and why 
it breaks down, if only to better appreciate its likely uncertainties in 
the intermediate region. 


For instance, QCD perturbation theory is good at high energies, 
but is it necessarily useless at low energies? The effective couplant 
is guaranteed to be small when Q > A, but, in some cases 
at least, it may remain small even when Q/A is small. That 
opens up the possibility — modulo non-perturbative terms — that 
perturbation theory may have something useful to say about low 
energies. (Note that a secondary approximation of re-expanding in 
powers of 1/In(Q/A) would spoil any such possibility.) This issue 
is intertwined with RS dependence, of course, so it is too early to 
discuss it further. 


4.3. The Principle of Minimal Sensitivity 


“Non-invariant approximations” are the focus of the remainder of the 
chapter. These are approximations where the results depend on some 
“extraneous” parameter(s) that we know the exact result cannot 
depend on. Non-invariant approximations can arise in various ways. 
Suppose — in the days before computers — a physicist needed to 
accurately evaluate an integral on the range 0 to 1, with an integrand 
that was singular at the bottom limit, behaving as, say, 1/,/z, as 
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x — 0, but otherwise smooth. He or she might naturally make an 
asymptotic expansion about x = 0 and integrate that analytically 
up to some small value x9, and then add the result to a numerical 
integration from x9 to 1. But, what value of xo should be used? The 
answer is, of course, “it should not matter.” However, while it should 
not matter, the uncomfortable fact is that it does. Of course, if one 
needed only two-decimal-place accuracy and the results, for a few 
plausible values of xg only differed in the fifth decimal place, then 
one would probably be quite happy. But if it were very important 
to get the best possible accuracy then it would make sense to think 
carefully about what xo is optimal. Our argument is that, in the 
absence of other information, the best choice for xo is where the result 
is least sensitive to small variations in x9. We call this the “Principle 
of Minimal Sensitivity.” 


If an approximant depends on “extraneous” parameters, then — 
in the absence of further information — their values should be 
chosen so as to minimize the sensitivity of the approximant to small 
variations in those parameters. 


We mean this just as an explicit statement of a piece of common 
sense. It is a notion that has no doubt been employed many times 
in various specific contexts without much fanfare. By calling it a 
“principle” we do not mean to suggest that it is on a par with, 
say, the Principle of Least Action, but merely to emphasize its great 
generality. We regard it almost as a moral principle: “things should 
be independent of what things should be independent of.” 

There is no theorem here. We are talking about what is the best 
gamble to make, given all that we know. The fact that we know 
that the exact result is independent of xg is a valuable piece of 
information, and ought to be made use of. One could hardly believe 
that the approximation was a good one if it varied a lot under a small 
change of zo. Our point is that where the approximate result is least 
sensitive to small variations of £o is where it is most believable. (See 
Fig. 4.1.) 

Of course, the argument is qualified by the phrase “in the absence 
of further information,” since in rare cases we might know a specific 


64 Renormalized Perturbation Theory and Its Optimization 

7 

6 

5 

4 

3 

2 

1 

0 Xo Xo 
0.0 0.5 1.0 1.5 2.0 0.0 0.5 1.0 1.5 2.0 


(a) (b) 


Fig. 4.1. (a) A non-invariant approximation gives a result that is a function of 
some extraneous parameter xo. (b) We also know that the exact result, whatever 
its value may be, is independent of xo. In what sense is (a) a good approximation 
to (b)? 


fact that definitely indicated a different choice for the extraneous 
parameter. (Even in such cases, we would argue, it is likely that the 
alternative choice is quite close to the minimal-sensitivity choice, and 
makes only an insignificant difference.) In many cases it is possible 
to make intuitive arguments for a good choice of the extraneous 
parameter — an example is in Fig. 4.4 — but these arguments are 
generally very “fuzzy” and merely corroborate, and provide some 
insight into, the PMS choice. 


It is possible, of course, for the approximant to have more than 
one stationary point, or none. That issue does not arise in the RS- 
dependence problem, but it does in other applications of PMS. It 
will be discussed when it arises in some of the examples below. 


To better understand the above arguments, and to see how non- 
invariant approximations work, one can learn a lot from some simple 
examples. 


4.4. Quartic Oscillator Example: First-Order 
Approximation 


Consider the classic quantum-mechanical problem of computing the 
eigenvalues, En, of an anharmonic oscillator. For maximal simplicity, 
we specialize to an oscillator with a purely quartic potential. (The 
general case, with both x? and zt terms in the potential will be 
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discussed later in Sec. 4.8.) The Hamiltonian is 
H = ip? + dat, (4.3) 


where x and p are operators satisfying the commutation relation 
[z,p] = i. Trying to treat the Arf term as a “perturbation” is 
quite hopeless — the unperturbed Hamiltonian would then be just 
ip’, which has a continuous spectrum, qualitatively unlike the 
discrete spectrum of H. However, following the work of Caswell and 
Killingbeck, we may add and subtract an 2? term and write H as 
Ho + Aint with 


Ho = 3 (p + Rr?) ; Hint = —10? 2? + Azt. (4.4) 


Standard quantum-mechanical perturbation theory can now be 
applied, generating what we call the Caswell-Killingbeck (CK) 
expansion (it is also known as the “linear 6 expansion” ). 

Note that an “extraneous” parameter, Q, has been introduced. 
It is arbitrary, in the sense that the exact eigenvalues of H clearly 
do not depend on Q. However, our approximate results at any finite 
order will depend on Q; hence we describe the approximation as being 
“non-invariant.” 


The calculation is most easily done by introducing raising 
and lowering operators, a!,a, for a simple-harmonic oscillator of 
frequency 2. These have the commutation relation [a, at] = 1 and 
are related to the x, p operators by 


1 . JIQ 
aan p=-iy/ Baal) 


so that Ho = Q(ata + 4). The unperturbed eigenvalues are 


— 


EQ) = (n+ 4)0. 
The unperturbed states are given by 
+? 
a 
ny? = S10), 


n! 


and satisfy 


al |n)\© = yn + ijn ++ 1), 
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Straightforward calculations using the commutation relations yield 


(nl(a-+at)?|n)® = 2n+1, 
 (ni(a + al)4|n)© = 3(2n? + 2n +1). 


The first-order correction term for the nth eigenvalue is 


I 3A 
RP = (n|Hineln) = —5 (n+ 4) 0+ ACn + 2n+1). (4.5) 
Adding this correction to the zeroth-order term gives the first-order 
result: 


Ep? = BO) + EP = s(n +4) + (on? +2n +1). (46) 
Note that the first term has a factor 5 in front arising from a partial 
cancellation, 1— ż, between the zeroth-order term and the part of the 
first-order correction produced by the —10? 7? piece of Hin. Note 
also that Æ: can be written as © (n|Ho|n)® + © (n|Hing|n)© = 
((n|H|n), so that the first-order perturbative result can also be 
viewed as a variational estimate using the trial state |n)©. 

The result (4.6) naturally depends on the physical variable n, 
the quantum number labelling the eigenstates, but it also depends 
on the extraneous variable Q. Because of this unphysical dependence 
the result has no quantitative meaning unless and until we decide 
how Q is to be chosen. A plot of E! against Q, Fig. 4.2, shows 
that taking Q too big or too small is clearly bad; the result becomes 
infinitely large in either limit. But how, in the absence of further 
information, are we to find the “right” value of Q? (And is it the 
same for all energy levels, or does it depend on n?) 

We do have one piece of information, however: We know that 
the exact nth eigenvalue, En, is independent of Q. That is surely 
a valuable piece of information, and ought to be made use of. 
The Principle-of-Minimal-Sensitivity (PMS) argument is that it is 
sensible to choose Q to be in a region where the approximant is 
insensitive to small variations of Q. (Both Caswell and Killingbeck 
independently made this argument.) The approximant, Æ}, is 
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Fig. 4.2. Results for the low-lying energy levels Eo, £1, E2 to first order in the 
CK expansion, as a function of the extraneous variable Q, in units of Me: 
minimally sensitive to Q at the stationary point where 


dE 
2 =Q. 4. 
dQ j (=a 


This is the “minimal sensitivity” or “optimization” condition. Its 
solution yields the “optimal” value of the Q parameter — which 
indeed depends on which energy level we are calculating. 

Applied to Eq. (4.6) the PMS optimization condition leads to the 
equation 


1 oe 
5 (n+ 3) + (-2)2 (2n? + 2n +1) =0, (4.8) 


whose solution is 


L 
2n? + 2n +1)]3 


a [o,f 
= | (4.9) 


(Quite generally, we shall use an overbar to denote an “optimized” 
value.) Substituting this value into Eq. (4.6) yields the “optimized” 
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Table 4.1. The energy levels En of the quartic 
oscillator in units of \1/°. The approximate eigen- 
values E7°* from first order in the CK expansion, 
optimized by the PMS criterion, are compared 
with the essentially exact results from Hioe et al. 


n E:S peat Error 
0 0.68142 0.667986 2.01% 
1 2.42374 2.39364 1.26% 
2 4.68500 4.69680 —0.25% 
3 7.29111 7.33573 —0.61% 
10 31.3587 31.6595 —0.95% 
n — œ (1.36284) (1.37651) —0.99% 


x(nt a) x(n) 


result: 


1 
(2n? + 2n+1)]3 


= 3 
Bt = “(pn +2) [3A 


’ (4.10) 


If we compare with precise values from the literature (see Table 4.1), 
we find that this simple formula fits all the energy levels to within 
2%. The results for n = 0 and n = 1 are slight overestimates, by 2% 
and 1%, respectively, while for n = 2 and above the results are slight 
underestimates, by less than 1%. (For large n one can compare with 
the semi-classical result from Bohr-Sommerfeld quantization.) 


4.5. Quartic Oscillator Example: Discussion 


The key to this success is the “optimal” choice of Q, which is 
different for different n. For any fixed (n-independent) value of Q, 
the approximate result, Eq. (4.6), would give a very poor description 
of the spectrum, so it is crucial that Q is allowed to “run” with n. The 
PMS optimization naturally selects an appropriately n-dependent Q. 

Clearly, the PMS optimization condition (4.7) is reminiscent of 
the Variational Principle. However, only in a few specific instances 
there is a clear-cut connection. For the ground state (n = 0) our 
approximation corresponds to a variational calculation, with the 
ground state of the harmonic oscillator Ho, as the trial wavefunction. 
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In this instance we know, from the variational (Rayleigh—Ritz) theo- 
rem, that the approximate result EG° is greater than or equal to the 
exact eigenvalue Ep for any value of Q. Therefore by minimizing with 
respect to Q, as the PMS criterion does, we obtain an approximation 


to Eo that is unquestionably optimal. 


The same statement can be made about the first-excited-state 
(n = 1) case, using an extension of the variational theorem which 
says that the variational form (|H |} is greater than or equal to 
the exact E) for all states |) that are orthogonal to the exact 
ground state. Our trial state |1) is guaranteed to be orthogonal 
to the unknown exact ground state because the former is odd under 
parity, x + —x, whereas the latter is even. 


For n > 2, however, there is no variational inequality guaran- 
teeing that the approximate result must be greater than the exact 
eigenvalue for all values of Q. Indeed, such an inequality does not 
hold; it is violated in a small region of Q around the Q value. (See 
Fig. 4.3). 

Thus, for n > 2, the PMS choice of 2 is not optimal in the 
rigorous sense of minimizing the error; there are other values of 2 
that would lead to even more accurate results. Indeed, there are 
two “magic” values of Q, either side of the PMS Q, that would 
lead to the exact eigenvalue. Finding those values, however, would 
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Fig. 4.3. Close-ups of the region around the optimal Q, showing the comparison 
with the exact result. For the ground state, n = 0 the variational inequality 
guarantees that the first-order approximant is always greater than the exact 


result. For n = 2 and higher there is no such inequality. Nevertheless, in both 
cases the PMS choice of 2 gives an approximation of comparable accuracy. 
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require some real magic, since that would be equivalent to solving the 
quartic-oscillator problem exactly. When we refer to the PMS choice 
as “optimal” we clearly cannot — except in special circumstances, 
such as the n = 0,1 cases — claim to be using the word in a 
mathematically rigorous way. We mean it in the sense of “the best 
choice, in the absence of further information.” 

An intuitive understanding of why and how Q depends on n 
can be gained from the following rough argument. For a good 
approximation, we want our approximate potential energy, 57 2? 
to be a good approximation to the actual potential energy, Ax*. 
Obviously, this cannot be true for all x, but what we most need, 
when considering the nth energy level, is for it to be true where both 
potential energies are of order of the unperturbed energy EO. (See 
Fig. 4.4.) That is we want 

(n+ EO & 1072? w dat. (4.11) 


2 
From the above approximate relations, we can eliminate x as z? ~ 


5?/X and solve for Q: 


Ox (4(n+2)a)8. (4.12) 


Fig. 4.4. A sketch illustrating an intuitive argument for why Q should increase 
with n: We wish the zeroth-order potential 4073? to be a good approximation 
to the actual potential, Axt for energies of order En. For higher En we will need 
larger Q, corresponding to a steeper quadratic potential. 
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This is in rough accord with Eq. (4.9), having the crucial (n + 4)!/3 
behaviour for large n. 

Note that this is a hand-waving, order-of-magnitude argument; 
the details and the numerical factors are all debatable. (It is 
similar to the argument, in the RS-dependence problem, that the 
renormalization scale u should be chosen to be of order the physical 
scale Q. In both cases the argument is correct; good physics; but 
inherently vague.) Trying to use such “physical” arguments alone to 
fix the value of extraneous parameters is never satisfactory in our 
experience. However, they often provide physical insight into why 
the PMS optimized value is what it is. 


Some authors, faced with a non-invariant approximation, choose 
to fix the extraneous parameter, not by PMS, but by a notion that 
we call “fastest apparent convergence” (FAC). That approach is 
critically discussed in Appendix 4.A. 


4.6. Quartic Oscillator Example: Second Order 


One may proceed to calculate the energy eigenvalues to the next 
order in this perturbation theory. For simplicity, we consider only 
the ground state, n = 0. The general formula for the second-order 
correction to the ground-state energy is 


© (| Hint |O) |? 

=J | ee intl A | (4.13) 
jo Eo -E 

where the sum runs over all the states except j = 0. In the quartic- 


oscillator case, the only non-zero contributions are from j = 2 and 
J=. 


From the a,at algebra one finds that 


ogejo = 2, ato) = EE, 
20 20/2 
(0) (A|x4|0) = M 24 


(20)? 
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and from these results one obtains the two terms, which are 


KOLA 1 fo (v2) , , ( vz)" 
FO pO “Em \ 20 ejj’ 


2 
(O (Hml 1 v (ar) 


E® BD (49) (9) 


Hence, the correction is 


2 2 
CETATE 
a aC 08 Baa) 
This must be added to the result from first order, from Eq. (4.6) 
with n= 0. 


The second-order result is 


res 3 À À ? 
peb Tay (: +8 (È) -14 (È) ) i (4.14) 


As a function of Q this result does not have a stationary point, but it 
does exhibit a flat region around a point of inflexion. See the dashed 
curve in Fig. 4.5. 


Fig. 4.5. Results for Eo to first, second, and third orders in the CK expansion, 
as a function of the extraneous variable Q. 
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In such cases, which do not arise in the RS-dependence problem, 
the application of PMS is not entirely unambiguous. There are 
two reasonable alternative strategies. (a) One may use the point of 
inflexion, where 


d2 Eres 
a = =0. (4.15) 


The argument here is that, while we cannot make the slope, 


dE” /dQ, zero, we can find where the slope is minimized, so that 


> A somewhat 


the result is “least sensitive to small variations.’ 
unsatisfactory aspect of this argument is that the result then depends 
on selecting Q as the extraneous parameter, rather than, say 1/Q, or 
Q3, or some other function of Q. The usual PMS criterion, Eq. (4.7), 
has the nice property of being invariant under such re-definitions, 
but the second-derivative does not share that property. However, 
this concern is minor, in that, for any remotely reasonable choices of 
variable, the result changes only by an amount well within the error 
that one would estimate anyway. (b) The other strategy is to allow 
complex solutions to the usual PMS condition. Close to the point of 
inflexion there will be a complex-conjugate pair of roots to Eq. (4.7). 
Evaluating the result at either of these roots will yield an answer 
with a small imaginary part, which we can then drop. We could view 
this as averaging the results at the two roots, since they will have 
equal and opposite imaginary parts. In doing so we are making use of 
another piece of information we have about the exact result; namely, 
that it is real. 

The results of methods (a) and (b) are given in Table 4.2. They 
agree quite closely, and each produces a gratifying improvement on 
Table 4.2. Optimized results, in first and second order of 
the CK expansion, for the ground-state energy of the quartic 


oscillator, in units of A1/3. The exact value is 0.66798626 .. .. 


Order Q Ene Error 


Ist 1.81712 0.681420 2.0% 
2nd (a) 2.06064 0.668973 0.15% 


2nd (b) 2.02015 + 0.200731 0.668641 + 0.002191i 0.10% 
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the first-order result, reducing the error from about 2% to less than 
0.2%. 

We will return in the next chapter to discuss higher orders in the 
CK expansion. 


4.7. Quartic Oscillator Example: Wavefunctions 


One may also obtain accurate approximate wavefunctions with the 
same approach, as shown by Kauffmann and Perez (KP), whose work 
we follow in this section. The key point is this: Since the wavefunction 
is not a single number, but a function of position, x, the “optimal” 
Q will depend on g. Here, we will consider only the ground-state 
wavefunction, but the story is similar for the excited states. 


The calculation proceeds as follows: The formula for the state 
vector of the nth eigenstate, to first order in perturbation theory, 
is 
=S l, 
jan 
with 


ng — 


 (n| Hint |g) 
EO — EO 
In the case of the ground state, n = 0, the only non-zero terms in 


the sum come from j = 2 and j = 4. From the results in the previ- 
ous section, one can obtain the two non-zero coefficients, which are 


= inl 1 (_ yoo (v2), ( Ov? 
o BO BO ~ any \ 2 \ 20 CAJ)’ 


© (4| Hint |0) 1 ( a 


cua = SS _ = 
EO BO ~ C0) 


(29)? 


The Schrödinger wavefunctions representing the unperturbed 
states |0)®, |2), |4) are 


1/4 
a = (S) tin), 


T 
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po = n - exp (—4027) ee (-1 + 20x”) 
2 T 2 V2 ’ 
Q\4 1 
yo = (=) exp (—3.227) ae (3 — 1202? + 40? r*) . 


The result for the ground-state wavefunction, to first order in 
perturbation theory is then 


0 = o + core” + cosy ®. 


Note that the overall normalization will need to be adjusted at the 
end of the calculation. 


The result of the perturbative calculation is 


res Q me 1 
po = (=) exp (—402”) Te 


x (u + 402? + Z0 — 12Qz? — aat) ' (4.16) 


Optimizing Q, requiring 0y% /ðN = 0 at each x, leads to a quintic 
equation for the optimal Q: 


203 (7 — 40x? — 40724) + A(—99 + 660x? 
+ 36072* + 80%2°) = 0. (4.17) 


Picking the right root of this equation requires a little thought, as 
we now discuss (see Fig. 4.6). 

At x = 0 the unique real root is Q = (Mays — similar to, but 
not the same as, the optimum Q for the ground-state energy, which 
was (6A)!/3. For small but non-zero |2| there are two real, positive 
roots, but one goes to infinity as x — 0, so clearly, for continuity, we 
want the smaller root that starts from (BAI 3 at x = 0. This root 
persists until x ~ 0.65A71/6, where it meets up with the other, larger 
root. 

At large |x| there are also two positive, real roots, but one goes to 
zero proportional to 1/x?: That would produce a non-normalizable 
form for 7, and so can be rejected. The relevant root has Q~ VA |z| 
at large |x| — which means that the resulting wavefunction has the 
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Fig. 4.6. The optimal value of Q as a function of | æ |, in units where \ = 1. 
The relevant root is shown by the solid lines. The segment around | x |+ 0.7 is 
actually the real part of a pair of complex roots. The three other roots, at any 
given |x|, are negative or have negative real parts. These, and the roots shown 
by the dotted lines, may be discarded as clearly inappropriate. 


correct faster-than-Gaussian fall off, with an exp(—4W |x|?) factor. 
By continuity, we can follow this root back to x ~ 0.75A71/6, where 
it meets up with the smaller root. 

There is region, roughly 0.65 < \!/6 | x |< 0.75, where there 
is no real, positive root. For | æ | in this range there is no actual 
stationary point in Q, only a flat region around where 077/00? 
vanishes. As discussed above, we can either use this point of inflexion, 
or — perhaps better — use either of the complex roots (whose real 
part is shown in the figure) and then, at the end, discard the tiny 
imaginary part of the resulting wavefunction. 

Although the resulting Q as a function of |x| has an odd-looking 
zig-zag (see Fig. 4.6), there is no discernible lack of smoothness in 
the resulting wavefunction. This is because w is, of course, very 
insensitive to the precise value of 2, provided that it is close to 
Q. The resulting optimized wavefunction, plotted either on a linear 
or a logarithmic scale, is hardly distinguishable from the exact 
wavefunction. (See figures in KP.) Nevertheless, it would be sensible 
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to use an Q that smooths out the zig-zag region of Q, since we know 
that the exact wavefunction must be smooth. 


4.8. Anharmonic Oscillator and Double-Well Potential 


The more general case of an anharmonic oscillator — a sim- 
ple harmonic oscillator of unperturbed frequency w, with an «4 


perturbation — can be treated very similarly. The Hamiltonian, 
H = ip? + twr? + Ant, (4.18) 
can be written as Hp + Aint with 
Ho =} (P +072”), Him =} (Q? —w?) a? + Azt. (4.19) 


A simple generalization of the previous calculation yields the first- 
order correction term as 
(Q? — w’) 3A 


+ ——(2n? + 2n +1). (4.20) 


1 
1 
EQ) =—- (n+ 43) Q AQ? 


2 
Added to EO), this yields the first-order result 


1 (Y+) 3A 3 
= Ija n —— 
5 (n+ 4) 7) + 72 2" + 2n +1). (4.21) 


Applying the PMS condition dEf**/dQ = 0 leads now to a cubic 
equation: 


res __ 
Ep = 


n + 1)Q (Q? — w?) — 3A(2n? + 2n +1) = 0, (4.22) 
2 


whose positive root is the optimal value, 2. Substituting into Ets 
yields the optimized result. 


Caswell has a nice formalism using a variable 3 defined by 


BX 
=e"? +, 
70 
which allows a unified treatment of all the cases w? positive, zero, 
or negative. (3 = 6, which is the optimal value for the ground state 
at first order, corresponds to the mass renormalization induced by 
normal ordering the Hamiltonian.) 
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For all positive w there is not much to say; the results are even 
better than for the quartic-oscillator case. Indeed, the larger w is, 
relative to \!/3, the better the perturbation theory gets, and the less 
the optimal Q differs from w. Nevertheless, the CK approach still has 
significant benefits over ordinary perturbation theory, particularly 
when one considers high-n energy levels. 

For imaginary w (negative w?) the classical potential has a 
double-well form. For |w | small, relative to \!/°, the CK expansion 
remains good. However, for larger |w | the CK expansion does not 
give satisfactory results — unless we are prepared to go to very 
high orders. There is a simple cure, however. We may generalize 
the method by adding and subtracting a linear term in x: That is, 
one takes Hp to be an oscillator of frequency 2 whose equilibrium is 
shifted to some value x9: 


Ho = 3 (P? + Q? (2 — 29)”), 


2 2 2.9 4 (4.23) 
Him = — 49° (x — z0)” + w s’ + Az". 


There are now two extraneous parameters, Q and 29, that can be 
optimized by the PMS condition. 

If we calculate the ground-state energy Fo(zo, Q) and then 
optimize Q, we obtain Vg(xo), a function of xo that, for sufficiently 
negative w?, has a local maximum at zo = 0 and a pair of minima 


at to = +%o for some non-zero o. The optimized result for the 
res 


ground-state energy E4° is then given by Vg at xo = Zo. 

The function Vg(zo) is called the “Gaussian effective potential.” 
It can be calculated for a wide variety of problems H = 1p? +V (£x) 
with all sorts of classical potentials V(x) and it provides a good 
“picture” of how quantum zero-point-energy effects modify the 
physics — in the present case, for instance, it shows that, for 
w? negative but small, quantum effects wash out the hump in 
the classical potential. Moreover, the concept generalizes naturally 
to QFT, where it has significant advantages over the traditional 
one-loop effective potential. See citations in the references for this 
chapter. 


One may similarly calculate results for the excited states. As n 
increases the optimal xo slowly decreases, until at some sufficiently 
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large n there ceases to be a non-zero Zp and the optimized result 
comes from xo = 0. 


4.9. Conclusions 


Approximations are a gamble; we are forced to act upon incomplete 
information. It is important to make good use of any and all 
information that we do have. Ambiguity is inherent in approximation; 
we always have to decide, somewhat arbitrarily, precisely what to 
calculate, and what the form of the approximant is to be. 

Non-invariant approximations bring in another ambiguity; what 
value(s) should the extraneous parameter(s) take? The PMS crite- 
rion, once we have decided what we are calculating and what the 
form of the approximant is, provides an objective resolution of that 
ambiguity. It uses the information that the exact result is known to 
be exactly independent of the extraneous parameter(s). 

The worst mistake with a non-invariant approximation is to 
assume that the extraneous parameter(s) have to be fixed once-and- 
for-all. In the quartic oscillator example, for example, one can get 
good results for all the energy levels — but only if the value of 2 
is different for the different levels. The PMS criterion automatically 
selects a suitable 2 value for each case. 

We are now ready to tackle the RS-dependence problem by 
viewing it as a non-invariant approximation and applying PMS 
optimization. The impatient reader may wish to skip the next chapter 
and proceed straight to Part II. The topic of the next chapter is high 
orders in examples of non-invariant approximations — the important 
point being that the optimal parameter(s) change from one order to 
the next, which can give rise to “induced convergence.” 


Appendix 4.A: FAC Criteria 


A different approach to dealing with non-invariant approximations 
is a notion dubbed “fastest apparent convergence” (FAC). The 
idea is that, at next-to-leading order, we would obviously like the 
correction term to be small in comparison to the leading term — 
so let us choose the extraneous parameter to make it so. Often, 
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in fact, the correction term can be made to vanish. (Because the 
zeroth- and first-order results then agree exactly with each other, 
this choice is sometimes called a “self-consistency” criterion — but 
that terminology is misleading since there is no logical necessity.) 

The FAC approach shares some of the virtues of the PMS 
method — crucially it allows the extraneous parameter to be different 
in different cases. Sometimes FAC and PMS give quite similar results, 
since the curves for two different orders will often intersect in the 
flat region. However, the curves can be nearly parallel in the flat 
region and bend away together, intersecting only quite far away, 
giving a poor result. In the author’s opinion and experience, FAC 
is unreliable. It is predicated on a property that one would like the 
approximation to have — rapid apparent convergence — whereas 
PMS is based on a property that the exact result does have — 
invariance under variations of the extraneous parameter. 

FAC is certainly inferior in the case of the CK expansion for the 
quartic oscillator. Requiring the correction term EŞ , Eq. (4.5), to 
vanish gives the equation 


1 3A 1. 2 
~ 5 (n+ 3) 2+ Fog (2n +2n+1) =0, (4A.1) 


whose solution is Opac: 


1 
3A (2n? + 2n+1)]? 
Q = | — =] , 4A.2 
pon |S (44.2) 
which is smaller than the PMS Q by a factor of 271/3. The FAC 
result is obtained by substituting this value of Q into Ær, which, by 
construction, is the same as EY) = (n+ 4)Q. Hence, the FAC result 
is 


3A (2n? + 2n + 1) 3 


En \pac = (n a 5) 2 (n es 1) 
2 


(4A.3) 


It is very similar to the PMS result in form and for all n values it is 


1/3 


larger by a factor of 327 x 1.06. Consequently, it is a much less 


accurate approximation. See Table 4.3. 
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Table 4.3. FAC results, to first order in the CK 
expansion, for the energy levels En of the quartic 
oscillator in units of \!/3. Compare with PMS 
results in Table 4.1. 


n BRAG Beck error 

0 0.721 0.667986 8.0% 
1 2.565 2.39364 7.2% 
2 4.958 4.69680 5.6% 
3 7.716 7.33573 5.2% 
10 33.186 31.6595 4.8% 
n —> oœ (1.442) (1.37651) 4.8% 


xina) x(n) 


At higher orders there are different ways to interpret the FAC 
idea: one could require either (i) that the last correction term 
vanishes so that adjacent orders agree, or (ii) that the net correction 
vanishes so that kth order agrees with zeroth order. In some examples 
the first approach is better than the second; in other examples it is 
the reverse. In the CK expansion method (ii) is hopelessly bad, while 
in Exercise 4.2 it is method (i) that is poor. 

In the RS-dependence problem, because there are multiple 
extraneous parameters, it is possible to the choose the RS in (k+1)th 
order so that all the perturbative coefficients rı to rọ vanish. Several 
authors have advocated use of this FAC or “effective charge” (EC) 
scheme. It is certainly useful for formal purposes (see Chapters 7 and 
13), but it is doubtful that formal simplicity is a sound argument that 
it provides the best approximation. At low orders there is often very 
little difference between the FAC/EC and PMS results. However, the 
“induced convergence” scenario (to be discussed in the next chapter), 
in which R(t» tends to a finite limit as k —> oo because a > 0, 
clearly could not work for FAC/EC, where a + 0 would entail R —> 0. 


Exercise 4.1. Consider the spatial integral 


2 
ala 1 
[or 3/2 . faz =, 
ao Vv 4T | R-r | 
arising from the Coulomb interaction of the electron in a hydrogen 
atom with another charged particle at R. The asymptotic result 
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for R + œ is 1/R, and one might expect to calculate corrections 
as a power series in ag/R by expanding the denominator in powers 
of r/R. Show that all such corrections vanish. (Hint: consider the 
generating function for Legendre polynomials.) The power series is 
thus, trivially, convergent — but it does not give the right answer. 
Evaluate the integral exactly and show that there is a correction 
exponentially small in ag/R. 


Exercise 4.2. Consider the problem of evaluating the integral F = 
ida f(x), between specific endpoints A and B, when nothing is 
known about the function f(a) except for the first few terms of its 
two Taylor series, about A and about B. The natural approximants 
are 


€ B 
Flnml(g) = a dx f(x) + f def" (2), 


where fe is the series about A, truncated after the (x — A)” term, 
and similarly for f fn l Clearly, these approximants depend on the 
extraneous variable €, though F itself does not. Show that the PMS 
criterion fixes € to be where the curves from the two endpoints 
cross, if they do. 

Consider the specific example f(x) = sin x with A = 0, B = 7/2. 
Note that the zeroth-order result F!°-°(€) = 4 — £ is monotonic 
in €. Find and plot the diagonal approximants Fl*-*l(€) for k = 0 
to 5. Apply the PMS criterion and compare and contrast with the 
results from a FAC criterion requiring either (i) no change from one 
order to the next, so that adjacent orders agree, or (ii) no change 
from the zeroth-order result. Verify the results in the table below 
and note the following: FAC(i) chooses € to be 0 or 7/2 and thus 
gives very poor results; FAC(ii) typically chooses a € around 0.57, 
on the outskirts of the flat region; the centre of the flat region, the 
PMS choice, yields good results consistently. 


I 2 3 q 5 
PMS 1.07 0.996 0.9982 1.00005 1.00002 
FAC(i) 1.57 1.23 0.925 0.980 1.0045 
FAC(ii) 1.57 0.996 0.9921 1.00015 1.0020 


Consider other choices of f(x). Note that when the function has a 
lot a structure between A and B, the results will inevitably be poor, 
until the Taylor series contain enough terms to begin to describe 
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the structure adequately. Beyond that point you should find the 
PMS criterion giving satisfactory results. 


Exercise 4.3. Consider a quantum particle of unit mass that moves 
in the x1, x2 plane subject to a potential \xz?732. The Hamiltonian is 


SPi + 33 + Arig. 
By adding and subtracting a term ¿Q(x? + 23) use first-order 


perturbation theory and the PMS to find an approximation to the 
energy levels (labelled by n1, n2 quantum numbers). 


Exercise 4.4. Consider the effect of an external electric field 
E on the n = 2 levels of hydrogen, ignoring spin and fine- 
structure effects. Standard first-order degenerate perturbation 
theory predicts a pattern where two levels are unaffected, while 
two other linear combinations of states are pushed up and down 
by an equal energy splitting toR = +3eao| E|, where R is the 
Rydberg constant, ag is the Bohr radius, and e is the electron 
charge. Experimentally, however, the observed splitting is slightly 
asymmetric. That can be predicted by a laborious second-order 
calculation, or by an improved version of the first-order calculation 
where we write the Hamiltonian as Ho(Z) + Hint(Z) with 


2 2 
p Ze 

Ho(Z) = am =È PE 
e2 


Calculate the matrix elements of Hint(Z) between the four unper- 
turbed states and diagonalize. Determine the optimal Z for each 
eigenvalue. Show, using a secondary approximation o < 1, that 
the splittings are o(1 -—o0+---) and —o(1 +0 +---) times R. 
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Chapter 5 


Induced Convergence 


5.1. Three Regions: Diverging, Flat, and Overdamped 


In non-invariant approximations the optimized value of the extrane- 
ous parameter evolves from one order to the next — and that fact 
can be vital to the convergence of the results. For the anharmonic 
oscillator, the CK expansion is divergent for any fixed value of the 
extraneous parameter Q. However, as we shall see, the CK results 
converge if we use the optimized value of Q, which steadily increases 
with order. We call this phenomenon “induced convergence.” 

We begin by considering a very simple example which, while 
not actually illustrating “induced convergence,” is nevertheless quite 
instructive about high orders in non-invariant approximations. Con- 
sider a simple harmonic oscillator of some given frequency w and a 
calculation of its ground-state energy Ep in a CK-like perturbation 
theory: 


1 1 
Ap = ~ T 075), Hin = =5 i w?) a. (5.1) 


That is, we start from an oscillator of a different frequency Q, and 
treat the frequency difference as a perturbation. It is easy to see that 
the resulting perturbation series is the expansion of 


ie 5ovi-2 =e (5.2) 


This is an open access book chapter published by World Scientific Publish- 
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in powers of z, where 
gel=u"/0', (5.3) 


with finite-order approximations corresponding to successive trunca- 
tions of the series 


1 1 1 1 5 
Eo = 5% (1 Zz- 22 — >z’ gs ) : (5.4) 


2^ 8 16 128 


Not surprisingly, the optimal choice of Q in any order is Q = w, 
and it always yields the exact result, Fo = 4w. Nevertheless, it is 
instructive to plot the results, in kth order, as a function of Q; see 
Fig. 5.1. As the order increases, one sees the curves becoming flatter 
and flatter around Q = w, with the first to the kth derivatives all 
vanishing there in kth order. The connection between the flatness and 
the accuracy of the approximation is evident. Note that the results 
diverge for Q < w/v2, reflecting the fact that the series has a radius 
of convergence |z|= 1. 

The figure illustrates the three regions that seem to be character- 
istic of non-invariant approximations. There is a “diverging region” 
at small Q where the results show wild oscillations from one order to 
the next; a “flat region” around the optimal Q; and an “overdamped 
region” at large Q. The extent of the flat region grows with the 
order k. Usually the centre of the flat region also moves, but in 
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Fig. 5.1. Results for Eo, as a function of the extraneous variable Q, in kth order 
of a CK-like expansion for a simple harmonic oscillator of unit frequency (w = 1). 
In every order the optimal Q is 1, as indicated by the arrow. The curves become 
flatter and flatter around Q = 1 as the order is increased. 
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this simple case it remains fixed at Q = w. In this example, the 
results ultimately converge at any Q, provided it exceeds w/V/2. 
However, if we choose an Q that is far too big — in the “overdamped 
region” — the convergence will be painfully slow (just as a strongly 
overdamped oscillator in classical mechanics takes a long time to 
return to equilibrium). Worse still, if we were to choose a too-small 
Q, in the “diverging region” Q < w//2, then our initial bad result 
only becomes worse and worse as we calculate more and more orders! 

In other contexts, the curves in the overdamped region may fall 
towards zero, rather than rising up to infinity, as they head away 
from the flat region. Also, the three regions may appear from right 
to left instead of from left to right; that occurs, for example, if we 
redefine the extraneous parameter from Q to 1/Q. The three regions 
show up in all the examples we study here. 


5.2. High Orders in the CK Expansion for the 
Quartic Oscillator 


The CK expansion is divergent for any fixed Q, just like ordinary 
perturbation theory for the anharmonic oscillator — where it is 
known that kth-order correction term behaves as k!A’k?C(1 + 
O(1/k))\* with calculable numerical coefficients A,B,C. In kth 
order the CK result for Ey can be written as a polynomial in /0°, 
which is the inverse of Caswell’s variable (: 


A A\" 
EM =) (Ae Aga (È) +e + Akk (È) ) . (5.5) 


The first coefficient A; o comes from just the x? term in Hint, so it 
is predictable from the simple example just discussed. The highest 
coefficient Aj, comes only from the Axt term in Hint and is the 
same as in ordinary perturbation theory. Caswell derives a recursion 
relation that allows the A; j; coefficients to be found efficiently. 


This polynomial can be seen as a truncated expansion in an 
effective dimensionless coupling constant 4/03. However, it is not 
the perturbation series itself, since the terms are not successive 
contributions from matrix elements of Hint. Indeed, it is not 
successive truncations of a fixed series: the coefficients Ax; all 
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depend upon k. The series (5.5) displays quite good apparent 
convergence at the optimal Q value. 


For any fixed Q the perturbation series itself diverges, but when Q is 
chosen in each order according to the “minimal sensitivity” criterion — 
so that it gradually increases with order — one finds the successive 
results converging quite nicely. These points are illustrated below. 

Figure 5.2 shows orders up to k = 5. One sees the three regions, 
with the flat region expanding, but also moving out to larger 2. It 
moves faster than it grows. Thus, for any fixed 2 one gets typical 
divergent-asymptotic-series behaviour. For example, with Q/ MB = 
2.1 the initial result is poor, but the results improve steadily for the 
next few orders, appearing to converge to a good answer — and then, 
quite abruptly, there are wilder and wilder oscillations from one order 
to the next; see Fig. 5.3. Choosing 2 a bit smaller would give better 
results at first order, but the divergent behaviour would set in almost 
immediately. (Choosing Q too small puts one in the diverging region 
right away.) Choosing 2 to be larger one stays in the overdamped 
region for a longer time, with worse results at low orders and a more 
gradual approach towards settling down — though now to a more 
precise value — before the divergent behaviour finally sets in. 


0.685 
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Fig. 5.2. Results for Eo to kth order in the CK expansion, as a function of the 
extraneous variable Q, in units of \!/?. The flat region grows, but it moves faster 
than it grows. For any fixed 2 the results diverge, but if we follow the flat region, 
finding the PMS optimal Q in each order, the results converge nicely. 
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Fig. 5.3. An illustration of the asymptotic-series behaviour at fixed Q. Results 
for Eo at kth order for k = 1 to 8 at Q = 2.1. 


With PMS optimization we ensure that a suitable Q is used at 
each order, starting with Q/A! = 6! = 1.82 at k = 1 and steadily 
increasing with k, so that we “surf” along with the flat region as it 
moves outwards. In fact, we want to “surf the scary edge” of the flat 
region, close to the boundary with the diverging region. This point is 
illustrated in Fig. 5.4, which shows results for 23rd order. Here, when 
we examine the flat region in detail, we find three stationary points. 
All give a good approximation, but by far the best approximation 
comes from the one closest to the diverging region — which is also 
the “flattest” in that it has the smallest second derivative. As k is 
increased, each individual stationary point moves off to the right 
and new stationary points are born at the “scary edge.” When being 
born these new stationary points may appear in embryo as points of 
inflexion. 

For the proof of convergence we refer the reader to references in 
the bibliography. The proof applies also to a ¢* field theory in 1 +1 
dimensions and shows that the convergence is exponentially fast. 

The shrinkage of the effective coupling A/Q? with increasing order 
is readily understandable intuitively through an extension of the 
argument in Sec. 4.5 and Fig. 4.4. Quantum mechanical perturbation 
theory for En at kth order involves excursions to various intermediate 
states |n;), producing a product of matrix elements 


(n| Hing |M2)(n2| -+ |M&)(r« | Hint |”); (5.6) 
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4 


Fig. 5.4. Results for Eo at 23rd order in the CK expansion, as a function of Q: 
(a) shows all three regions; diverging, flat, and overdamped, (b) shows the flat 
region on a fine scale, and (c) shows a very fine-scale close up of the front of the 
flat region. The vertical ranges are, respectively, (a) 0.667 to 0.670, (b) 0.66796 
to 0.668025, (c) 0.6679861 to 0.6679865. The exact result, shown as a dotted line, 
is 0.66798626. Of the three stationary points in the flat region the one closest to 
the onset of the divergent region is the most accurate. 


each with its corresponding energy denominator En — Ew. In 
very high orders most of these matrix elements involve high-lying 
eigenstates — and for such states, if the unperturbed states and 
energies are to mimic the actual ones, the appropriate Q is large. 
Thus, we can expect the optimal 2 to steadily increase with order. 

The same intuitive argument applies to the QFT case (provided 
we consider only perturbatively calculable, infrared-safe quantities). 
If we were to use “old-fashioned perturbation theory” then, again, 
the high orders would involve the physics of short-lived, high-energy 
intermediate states. In covariant Feynman perturbation theory 
the corresponding statement is that the high orders will involve 
highly virtual particles. In QCD, of course, the effective couplant 
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shrinks only logarithmically with energy scale, rather than the \/° 
behaviour of the oscillator problem. 


5.3. A Toy Model Framework 


For the remainder of this chapter we consider a toy model. It is a bit 
artificial, but has the virtue that the induced convergence property 
can be proved relatively easily. 

Given a series 


R = ao(1 + r1(0) ao + ra(0) a2 +--+), (5.7) 
let us make the substitution 
ao 
= = 5.8 
a=a(r) EE (5.8) 


where T is some real-valued parameter, and consider the resulting 
re-expansions 


R =a(l +ria +rza? +---). (5.9) 


The coefficients are easily determined to be 


rj =rj(r)= 2 jds (5.10) 


where 


(') E Ta (5.11) 
(j — i) 


are the binomial coefficients. This simple mathematical system 
mimics, to some extent, the RS dependence problem. Both the 
expansion parameter a and the coefficients r; in Eq. (5.9) depend on 
the extraneous variable 7, which plays the role of the RS. Obviously, 
R itself does not depend on 7, and the 7 dependences of a and the 
r; must cancel in Eq. (5.9). However, this cancellation is spoiled if 
the series is truncated. 

For the analogy with RS dependence to hold good it is important 
that all the expansion parameters should be on an equal footing. 
Equation (5.8) is the simplest example of a substitution which 
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achieves this. The symmetry between any two expansion parameters 
is made manifest by the relation 


1 1 , 
—-——=T-T. 5.12 
rae noes (5.12) 
In contrast, a substitution such as a = ag(1 + Tao) would not 


have been suitable, since it would have given ag a unique status. 
Equation (5.8) is particularly appropriate since it directly recalls 
the (leading-order) QCD running coupling constant formula, if we 
identify 7 as proportional to the logarithm of the renormalization 
scale. Indeed, one has the “6-function equation” 


“ =i (5.13) 
The analogy with renormalized perturbation theory is not per- 
fect. Here the multiplicity of expansion parameters, the “extraneous” 
variable, the “8 function,” and the resulting ambiguity of finite- 
order results have all been introduced artificially. Moreover, only a 
single extraneous variable has been introduced here, whereas the real 
problem involves (n — 1) RS parameters at nth order. Nevertheless, 
this oversimplified model can be very instructive. 
Differentiating Eq. (5.9) and using Eq. (5.13) leads to 


dR ’ 
r (—a°)(1 + 2ra + 3r2a? +--+) + (Fa? + r203 + F304 + ---). 
(5.14) 
The cancellations required for this to vanish yield 
drj . 
= T = JT5-1) (5.15) 


which determine the 7 dependence of the coefficients. Integration of 
these equations leads, of course, to Eq. (5.10), previously obtained 
by direct substitution. 

The nth-order approximant, defined as the truncated series 


RY jones růrn10!), (5.16) 


is now a known, well-defined function of T, as soon as we know the 
values of a and rj,...,7%n—1 at some fixed value of 7, such as T = 0. 
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It is not often necessary to have an explicit formula for R (r), but it 
is sometimes helpful. One may write R((r) as the rational function 


n 


R (7) = aa Sr ag? (" ) RI (0). (5.17) 
j=l 


This formula is most easily proved by induction: a direct proof 
requires a remarkable identity between binomial coefficients. 

The derivative of R”) is the residuum of the cancellations in 
Eq. (5.14): 


(n) 
“ = -nry aa. (5.18) 
T 


The “optimum” value of 7 at nth order, according to the PMS 
criterion, is where the right-hand side vanishes, which leads to the 
“optimization condition” 


fois =F) = 0. (5.19) 


(In this instance, the PMS condition is equivalent to a FAC-type 
condition that the last calculated coefficient is made to vanish.) After 
(n) 
Oo 


solving this equation for 7,, the “optimized” approximant Rept can 


be found by evaluating R™ (T) at the point T = Tp. 


If the PMS condition, Eq. (5.19), has no real solution — a 
situation that will arise in odd orders in the following example — 
one has two reasonable choices, as discussed in Sec. 4.6. One can 
consider solutions of R /dr? = 0, or one can use a complex 
solution of Eq. (5.19), finally discarding the small imaginary part 
of the resulting R). In the real RS-optimization problem, with 
multiple RS variables, one always finds a solution to the usual PMS 
condition, so we need not dwell on this issue. 


Clearly, one can investigate many specific examples within this 
framework, simply by beginning with a different initial series in 
Eq. (5.7). It is a simple exercise to show that if the initial series 
is geometric, then the optimized result in second order, and all 
subsequent orders, is exact. This is true irrespective of whether the 
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initial series was within its radius of convergence or not. In the next 


section we consider a more challenging example in detail. 


5.4. Alternating Factorial Series 


Consider the alternating factorial series 
R = ao(1 — lap + 2!a2 — 3!a8 + ---), (5.20) 


previously discussed after Eq. (4.2). In the transformed series, 
Eq. (5.9), the coefficients are given by 


j i 
ry =(-1P 710 ears (—1) #!T;[e77], (5.21) 


i=0 


where the notation T,,[F(a)] means “truncate the series for F(a) = 
Fo + Fia + --- immediately after the a” term” (i.e., T,[F'(a)] = 
Fo + Fia +--- + F,a”.). If r is fixed then the coefficients will 
eventually behave as (—1)/j!e~7 for sufficiently large j, with e~7 
being a constant. Therefore, the series remains divergent for all fixed 
values of 7. 

However, if we “optimize” the choice of T at each order then the 
resulting sequence of approximations is convergent. The proof will be 
given in the next section. First, we discuss the optimization procedure 
in more detail and present a numerical example. 

In this case the optimization condition in nth order, Eq. (5.19), 
becomes simply 


Tile ™] = 0. (5.22) 


For even values of n this equation has a single real root, which grows 
approximately linearly with n (see Eq. (5.26) below). 


For odd n there is no real root. As mentioned earlier, one may 


either use a complex root, or seek to minimize the slope, so that 
d?R™ /dr? = 0, which leads to 


(ag + 7) Tn-2le7™] + (n + 1)Tp-ale-**] = 0. 
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This equation has two real roots, but the larger corresponds to a 
maximum of the slope, and hence is not relevant. The appropriate 
root lies just beyond the 7 of the previous even order. 


Having solved for 7, one can immediately evaluate the optimized 
values of a, rj and R® from Eqs. (5.8), (5.16), (5.21). (Equivalently, 
one could substitute in Eq. (5.17).) Some numerical results for the 
case ag = 0.25 are shown in Table 5.1 and Fig. 5.5. 

The naive results, corresponding to the partial sums of Eq. (5.20), 
show some initial apparent convergence, but soon develop violent 
fluctuations. The optimized results, by contrast, converge in a steady 
fashion. This comes about because 7, grows with n at just the right 
rate. The resulting decrease in the effective expansion parameter än 
counterbalances the potential growth of the coefficients. The radius 
of the original series is zero, but asymptotically one is expanding in 
powers of a vanishing parameter. Note that it is convergence of a 
sequence — the sequence of optimized approximants — that really 
matters, not the convergence of any series. 


Table 5.1. Approximations to the alternating 
factorial series, for a9 = 0.25. The Borel sum 
has the value 0.20634565. 


Order Naive T a Optimum 
1 0.250 = -= -= 

2 0.188 1.00000 0.200 0.2000000 
3 0.219 1.43845 0.184 0.2061121 
4 0.195 1.59607 0.179 0.2054645 
5 0.219 2.00000 0.167 0.2062757 
6 0.189 2.18061 0.162 0.2061192 
T 0.233 2.59875 0.152 0.2063073 
8 0.156 2.75900 0.148 0.2062634 
9 0.310 3.19736 0.139 0.2063235 
10 —0.036 3.33355 0.136 0.2063083 
11 0.829 3.79230 0.128 0.2063320 
12 —1.550 3.90545 0.126 0.2063259 
13 5.588 4.38352 0.119  0.2063367 
14 —17.610 4.47541 0.118  0.2063340 


15 63.581 4.97148 0.111  0.2063394 
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Fig. 5.5. Optimized results for the alternating factorial series example. Even 
orders shown by dots, odd orders by squares. The Borel sum is indicated by the 
dashed line. 


Having said that, it is nevertheless interesting to see how the 
terms of the (truncated) series behave numerically at the “optimized” 
value of T. A typical example is 8th order, which looks like 


RC) (T) = 0.148(1 + 0.260 + 0.090 + 0.028 + 0.011 
+ 0.003 + 0.002 + 0). (5.23) 


Thus, the “apparent convergence” of the series in the “optimized” 
scheme is perfectly satisfactory. The residual error one might rea- 
sonably guess from this, say, about 0.148 x 0.001, is a reasonable 
indicator of the actual error (which is, in fact, about half this size). 
Note that our estimate here used the trend of the last few terms, not 
just the last term alone — because it, of course, was made to vanish 
exactly by the optimization condition. (One may also make an error 
estimate based on the sequence of optimized results in Table 5.1, 
with reasonable results.) 

It is also instructive to plot some graphs of R® (T) against 7T 
for various values of n. This has been done in Fig. 5.6, by utilizing 
Eq. (5.17). As before, one can describe each curve in terms of three 
main regions: (i) a “diverging region” at small r where the function 
tends rapidly to +00 or —oo, depending on whether n is odd or even, 
(ii) a “flat region” around 7 = Mn, and (iii) an “overdamped region” at 
large T in which R™ (r) is small. The fuzzy boundaries between the 
regions move steadily to the right as n increases. They do so at slightly 
different rates, so that the width of the flat region slowly grows. 
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Fig. 5.6. Approximants R(T), n = 10, 11, 12, 14, for the alternating factorial 
series example, shown as functions of the extraneous variable T. 


If we sit at some fixed, reasonably large T, we see each of these 
regions in turn, as n increases. Low-order results are too small, but 
they grow steadily and begin to settle down to an almost constant 
value. Then, quite suddenly, we are overtaken by the diverging region, 
and the results start to oscillate increasingly violently from one order 
to the next. The view from the “comoving frame” of the optimized 7 
is quite different. One sees the approximation converging smoothly, 
and becoming flat over an increasing range of r. From this vantage 
point the approximation is not only becoming steadily more accurate, 
it is also successfully mimicking the t-independence property of the 
exact result. 


5.5. Proof of Convergence 


This section outlines the proof that the sequence of optimized 
approximants does indeed converge. One first needs to know the 
behaviour of 7, as n — oo. For this purpose one needs the integral 
representation 


le 7] = e77 (1 + oo l “at rte) , (5.24) 


obtained by repeated integration by parts. In the case that n is even, 
the optimization condition T,_1|e~™] = 0 reduces to 


f ater = T(n). (5.25) 
0 
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One can now consider this as an abstract problem with n ranging 
over all real values. One can take the difference of this equation and 
the corresponding equation with n > n+ 1 and then put bounds on 
the remaining integral term. Then, using the asymptotic behaviour 
of the T function, one finds that 


Tn ~xn+ €lnn+O(1), (5.26) 
where xy œ% 0.278 is the solution to 
Iny+yv+1=0, (5.27) 


and € = iy/(1+ x) æ% 0.109. For odd n an analysis of the 
dR) /dr? = 0 equation leads to the same result, except that the 
coefficient of the Inn term in Eq. (5.26) is altered. 

The alternating factorial series, Eq. (5.20), is a classic example 
of a Borel-summable divergent series. The sum of the series, in the 
sense of Borel, is given by the integral 


Ras fo du e~v/2 — (5.28) 


l+u 
Heuristically, the series is generated by expanding 1/(1 + u) as 1 — 
u+u?—u>+--- and then integrating term by term. Of course, 
the expansion is only valid for |u| < 1 whereas the integral involves 
0 < u < œ. The difficulty is exposed by using the formula for the 
sum of a truncated geometric series: 


Tni = = os (5.29) 


Thus, truncations of Eq. (5.20) correspond to 


= fo du e—%/2 Pa , (5.30) 


in which the (—u)” term is far from negligible as n — ov, and is 
the cause of the wild oscillations. Recalling that 1/aọ = 1/a — 7, 
one can write down a similar integral representation of the nth-order 
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approximant at a general value of T: 


oo 1 
RM (7) = f due“? Tp lew |: 5.31 
(T) es IE (5.31) 


The following algebraic manipulations: 
1 1 UT 1 
T PN =T. |—— ba ae 
nı [e — m ll + Prel 
(ur)? 1 1— (-—u)” 
To = | ———— 
(n—1)! l+u l+u 


ee a 


n 
1 UT (ur)r—1 
a eT 1 et ee Sy 


i: T! m=) IFu 
—T —T n-1 
~ al - u (Tn-i[e""] — (—u)"Tp_rle7]) , (5.32) 


then lead to 


R™(r)= | d 
(T) o Mire 


(Tn-1[e"7] — (—u)"Tr-ife-7]). (5.33) 


The optimization condition chooses 7 such that T,_1[e~7] = 0, and 
so it eliminates the troublesome (—u)" term. (Let us restrict the 
discussion to even values of n for the moment.) If T,1[e"7] could be 
replaced by e“™ as n — oo, then one would have established that the 
sequence of optimized approximants converges to Rp. 

However, that replacement is not valid when 7 is mn, which 
is linearly increasing with n. Each term in the difference (e“™ — 
Tn—1[e"™]) generates only an O(1/n) correction, but the sum of those 
corrections is finite. To show that fact one can use the representation 
equation (5.24) to rewrite Eq. (5.33), for T = 7n, as 


e—/a0 y(n, UTn) 
0 1+u T(n) 


(5.34) 
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where 7(n, z) is the incomplete [ function 


v(n,2)= f duu" te, (5.35) 
0 


and ['(n) is, of course, y(n, 0o). One requires the large-n behaviour 
of y(n, z(n)) when z behaves as uyn + O(Inn). Applying Laplace’s 
method, one finds the result (see references in the Bibliography) 


a (2% sw) D n ux <1, eae 


1, ux>t. 


Laplace’s method can also be used to show that the result is 
unchanged when the ¿lnn term in 7,, Eq. (5.26), is included. (This 
indicates, incidentally, that the flat region grows at least as fast 
as Inn.) From the above equation, we see that the second term in 
Eq. (5.34) receives contributions only for u > 1/y ~ 3.591, and its 
integrand has exactly the same form as that of Rp. Thus, the limit 
of the sequence of optimized approximants is not the same as the 
Borel sum: Instead, 


has 1/x a 1 


Ma Rips 6 1l+y 


(5.37) 
The difference from Rpg lies only in the upper limit of the integral — 
something that does not affect the series expansion. The difference 
is exponentially small, O(age7!/ (xa0)) for small ag. In the numerical 
example of Table 5.1 with aj = 0.25, the difference is 3 x 1078. (This 
difference is far too small to be visible in the figures earlier.) 


The above proof was actually restricted to the case n even. When 
n is odd we also need to consider the T,,-1[e~7] term in Eq. (5.33). 
One can show, by an extension of Laplace’s technique, that this 
term gives a contribution that is suppressed by a 1/n factor, so 
that odd orders converge to the same limit as even orders. (The 
1/n factor was unfortunately mislaid in Ref. [10], which erroneously 
stated that odd orders converged to a different limit.) 


We caution that one should not necessarily assume that the Borel 
sum is “right” and the optimized limit is slightly “wrong” — nor vice 
versa. As pointed out in Sec. 4.1 that issue can only be meaningfully 
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addressed given a specific context in which the original divergent 
series arose. One of the most unsatisfactory features of our “toy 
model” framework is just that it has no such context, since we began 
with “Given a series ...”. Also, here one expansion parameter is 
actually a bit special, because the expansion in ag has a particularly 
simple and regular form, compared to general a — whereas in the 
real problem all a’s are fundamentally on an equal footing, with the 
counterpart to ag being the ill-defined bare coupling constant. 


5.6. Conclusions 


This chapter has only scratched the surface of a large subject that 
raises difficult mathematical challenges. The conditions under which 
induced convergence will occur, and whether it works in the context 
of OPT applied to QCD remain unanswered questions. 


Exercise 5.1. Prove the claim at end of Sec. 5.3 that if the initial 
series is a geometric series, then the optimized result in second 
order, and all subsequent orders, is exact — irrespective of whether 
the initial series is within its radius of convergence or not. 


Exercise 5.2. Prove Eq. (5.17) by induction. 
Exercise 5.3. Prove Eq. (5.26) for the high-n behaviour of 7n. 


Exercise 5.4. Find how Fig. 5.6 would appear for very large n, if 
R) is plotted as a function of t = r/n. By considering the second 
term of Eq. (5.33), show that the boundary between the divergent 
and flat regions is at t = x, so that the optimized result “surfs the 
scary edge.” 
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Optimized Perturbation Theory 
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Chapter 6 


Preliminaries: RG Invariance, 
int-G Equation, A Definition, 
and CG Relation 


6.1. Physical Quantities and Their RG Invariance 


QFT involves many theoretical objects — Green’s functions, for 
instance — that are not physically measurable and are not RG 
invariant. While these are very important, they are not directly 
relevant to our concern here. Ultimately, it is only the theory’s 
predictions for physical, experimentally measurable quantities that 
will matter. For our purposes then, Green’s functions, etc., are merely 
intermediate steps in calculating physical quantities and we shall 
focus our discussion on the physical quantities themselves. 

Some physical quantities, such as hadron masses, are inherently 
inaccessible to perturbation theory. Others have a factorized form 
where one factor is non-perturbative; such quantities will be dis- 
cussed in Chapter 12. (Other physical quantities, such as the QCD 
pressure at high temperature, have a still more complex structure.) 
Here we focus on quantities with a normal perturbation series. These 
have the form A, + Ao with a leading-order coefficient A, and, 
sometimes, a zeroth-order term Ap. The coefficients Ag and A, 
(which carry the dimensions of mass to the appropriate power) are 
RS invariant, so we may focus on dimensionless physical quantities 
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R of the form: 
R=a¥(l+rja+roa*%+---), (6.1) 


where a is the renormalized couplant gł /(4r?). The power P is 
typically 1 or 2 or 3, but may be left general. At the level of formal 
power series there would be no loss of generality in taking P = 1, since 
one could always replace R with another physical quantity R!/?. 
However, that trick would not be natural when œR is to be truncated 
at finite order. In some places, to simplify the presentation, we will 
first discuss the case P = 1, before generalizing to any P. 

Often R is not a single quantity but is a function of several 
experimentally defined parameters. We may always single out one 
parameter with the dimensions of mass that we will call the “physical 
energy scale” and denote by “Q.” All the other experimental 
parameters can be taken to be dimensionless: if they were not 
originally we may scale them with an appropriate power of Q to make 
them so. We will need “Q” only in order to discuss which quantities 
are, and which are not, Q dependent. Thus, we may leave the specific 
definition of Q in any particular case to the reader’s choice, without 
creating any ambiguity. 

The dependence of R on Q is of great interest. When the original 
Lagrangian of the theory contains no parameter with dimensions 
of mass, but only a dimensionless bare coupling constant, gp, then 
Dimensional Analysis would seem to imply that the dimensionless R 
must be independent of Q. As discussed in Chapter 2, that conclusion 
is false because of the need for renormalization — which necessarily 
introduces a renormalization scale, u. A calculation of the coefficient 
rı reveals that it depends logarithmically on the ratio of Q to u 
(and r2 involves a log-squared, etc.). Although the u dependence 
ultimately cancels in R, the Q dependence survives. Specification of 
a boundary condition for the $-function equation (see Sec. 6.3 below) 
introduces a mass parameter A, and R is ultimately a function of the 
ratio Q/ A. The phenomenon that a theory, seemingly involving only 
a dimensionless constant gg, ends up specified by a characteristic- 
scale parameter A is known as “Dimensional Transmutation.” 

The key property of a physical quantity is that of “Renormaliza- 
tion Group” (RG) invariance, which means that a physical quantity 
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is independent of the renormalization scheme (RS); that is, of the 
precise definition of the renormalized couplant, a. Heuristically, this 
property might be expressed symbolically as 


dR OR da OR 


(RS) — a(RS)|, + NRI) Oa’ ee) 


where the total RS dependence is separated into two sources: (i) RS 
dependence from the series coefficients, r; (differentiation pretending 
that a is constant), and (ii) RS dependence from the couplant, a. 


A note on notation: The partial derivative 2R is the derivative 
regarding the coefficients r; as constant, and so is just the ordinary 
derivative of R’s power series. There is a reason we do not write it 


as dR/da; see Sec. 8.3. 


A particular case of Eq. (6.2) is the familiar RG equation 
expressing the renormalization-scale independence of R: 


o o 
(u tog )R=o (6.3) 
which involves the famous “8 function,” defined as 
da 
= u—. A 
B(a) = T (6.4) 


It is important to stress that there is not a unique 8 function 
characteristic of a given theory. The 6 function, like a, is an RS- 
dependent object. We shall write the 8 function, in some arbitrary 
RS, in the form 


B(a) = —ba? (1 + ca + cpa? 4+---). (6.5) 


As shown in the next section, the coefficients b and c are invariants, 
but all the higher coefficients c2, c3,... depend on RS. 


Note that an explicit minus sign is included in front of the 
expansion of 6 in Eq. (6.5) so that b is a positive number in 
the most interesting (“asymptotically free”) QFTs. However, all 
our discussion applies equally to theories with negative b, with 
the understanding that then the infrared and ultraviolet limits 
exchange roles. (For the special case of theories with b = 0 see 
Exercise 6.1.) The reason for factoring out b in 6(a)’s expansion 
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is that it is then absent, or scales out of, almost all subsequent 
equations. 


What exactly is the “renormalization scale” u? In defining a 
particular RS many arbitrary choices are made that, explicitly or 
implicitly, introduce many arbitrary parameters. One of these param- 
eters necessarily has the dimensions of mass and may be singled out 
as “u;” any other parameters involved can be taken as dimensionless 
since otherwise we could always scale them with the appropriate 
power of u. It is sometimes helpful to use the terminology that a 
“renormalization prescription” (RP) is a set of renormalization con- 
ventions that fixes all the RS choices, including defining the “mean- 
ing” of u, but does not fix the value of u. (The MS scheme is really 
a “prescription” in this terminology.) Often “RP” and “RS” can be 
used interchangeably, but sometimes the distinction is useful to make. 

The precise definition of u in any specific RP, and the particular 
choice of Q for any specific physical quantity, may be left to the whim 
of the reader. The results of optimized perturbation theory will be 
the same, in content, whatever choices are made. 


6.2. RP Invariance of b and c 


A key fact, first appreciated by ’t Hooft, is that the first two 
coefficients, b and c, of the 8 function are RP invariant, while the 
higher coefficients are not. The proof is straightforward: Two RP’s 
(that is, two schemes with the same value of u) are related by a 
general transformation 


a’ =a(1+uja+vga*+---), (6.6) 

where the v; coefficients cannot depend on u, for dimensional reasons. 
In the primed RP the 8 function: 
— dd 
=p a 
will not be the same as the original 6 function. The two are related 
by 


B'(a’) = b'a? (1 +da +a? +- ) (6.7) 


da! da’ da da’ 


b'(a) = Ppa = do da = Prak (6.8) 
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With da’ /da evaluated from Eq. (6.6) this gives 
B'(a’) = (1 + 2uya + 3020? +--+) B(a). (6.9) 
Expanding the right-hand side then leads to 


(1 + 2uja + 302a? + ---)(—ba?)(1 + ca + cpa? + ---) 


= ba?(1 + (2v + c)a4 (3v2 + 2uje+c2)a7+---), 
(6.10) 


while eliminating a’ in favour of a in the left-hand side gives 
=f (a(1 + ma+ mo? +j 14 eo(l + na+--)+ Ga? +>) 
= Va? (1 + (2v1 + c'a + (2v2 + v7 + 3v1 + chja? +--+). 
(6.11) 
Equating coefficients of a? and a? shows that 
b =b and =G (6.12) 


Note that the higher coefficients are not invariant; for instance c} = 
co + vg — vi — vic, as one can see from equating coefficients of at 
above. 


In gauge theories some RP’s are said to be “gauge dependent.” 
This just means that the RP is not fully defined until the gauge 
choice is specified. As mentioned in Appendix 2.C, in such RP’s the 
B function should be defined with the “0a/Op derivative taken at 
constant renormalized gauge parameter, €, not at constant bare ég. 
With the other definition the second coefficient of the 8 function 
would not be invariant because the renormalized € would then be a 
source of implicit ju dependence in the v; coefficients. Note also that 
R 


there is no need for a ES) g term in the RG equation, Eq. (6.2) 


because R = 0, since physical quantities are gauge invariant. 


2 


In any specific renormalizable QFT the invariants b,c can be 
calculated and we regard them here as givens. For instance, in 
Quantum Chromodynamics (QCD) with ny flavours of massless 
quarks one has 


_33-2ny = 153 — 19g 


b easel 
6° © 9@3— 207) 


(6.13) 
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6.3. Integrated 3-Function Equation; Definition of A 


The 6-function equation 


da 
—= 14 
we = Bla), (6.14) 
integrates immediately to 
da 
lnu = ls + const. 6.15 
BO = 


However, specifying the constant of integration is a slightly tricky 
task and requires a careful discussion. The first step is to write 
the constant of integration as ln À + C, where A has dimensions of 
mass, so that we can convert the In yw to a logarithm with a properly 
dimensionless argument. Next, since (a) is known only as a small-a 
expansion, we naturally want the range of integration to be 0 to a. 
That gives us 


~ dx 
In(u/A) = — +C, (6.16) 

o (z) 
but the integral is then divergent — so the constant C must also 
be divergent in a compensating fashion. A precise specification of 
C amounts to a definition of the A parameter. (That will define A 
within the specific RP we are using. Later on we will need to ask if 
A depends on the RP choice.) One convenient choice of C is to write: 


m(4/Ã) = lim (f° a + c(6)) l (6.17) 
with 
H= im ica (6.18) 


Note that C(ô) involves only b and c and so is RS invariant. 


The tilde in “A” is included to distinguish it from another 
widely-adopted definition of the A parameter in the literature. The 
two definitions are related by an RS-invariant factor: In(A/A) = 
(c/b) In|2c/b|. See the discussion in section 6.4 below. 
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It is convenient to introduce the variable 


T =bln(u/A) (6.19) 
and to define 
_ B(x) 2 
B(x) = pa =1+cr +e +.. (6.20) 


Equation (6.17) then becomes, after splitting the C(ô) integral into 
two pieces, 


= [ dx P f dz j: L dx 
ae: 5 2? B(x) Js (+err) Ja 2A +c) 
(6.21) 


This equation gives 7 as a function of a, which we shall name K (a): 


r=k@= [ate f e. 
(6.22) 


Note that the second term is now a well-defined integral, 
convergent at the lower limit x — 0. The first term gives the second- 
order approximation to K(a) and is easily evaluated using partial 
fractions: 


= l 2 & 
= > = 6.23 
/ a (= c5) a 


ca 
= — + cln : 
| 1+ca 
To summarize: the integrated 6-function equation — which we 
will henceforth refer to as the “int-G equation” — is given by 
= Kia), (6.24) 
with 


— A(a), (6.25) 
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where 


A(a) = [5 (a5 = ss) (6.26) 


The above form is quite satisfactory when c is positive (which 
is the case for “real-world QCD,” where nf < 6). However, if c is 
negative, which occurs in hypothetical theories with a large number 
of fermions, there is a slight problem with a pole at x = —1/c. In 
that case, we need to specify the Cauchy principal value for the C(ô) 
integral in Eq. (6.18). The easiest way around the problem is to 
rewrite A (a) in the form 


K(a) = L + cln |ca| — (a), (6.27) 


where 


A(a) = [5 — =1+ cr) l (6.28) 


(For further discussion of this integral, see Appendix 9.A.) The 
equivalence between the two forms of K(a) can easily be checked 
by verifying that 


Ata) - aa) = [S$ (-14e+ 4) 


a fo 
= i de — (6.29) 
0 1+cx 
= cln |1 + cal. (6.30) 


Thus, Eqs. (6.25) and (6.27) correspond to the same definition of A. 


6.4. Inverting the int-3 Equation 


Ideally, one would like to invert the int-G equation to express the 
couplant a as a function of 7. Alas, that step cannot be carried out 
analytically. One might attempt to use an analytic approximation, 
such as an expansion in inverse powers of 7, but that procedure 
has serious drawbacks. For one thing the expansion is ugly, involving 
logarithms of 7. Also, it applies only when py >> A and thus precludes 
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any attempt to go to low energies. Most seriously, the errors it 
introduces, even at high energies, are hard to estimate and control. 
The routine matter of inverting a transcendental equation should 
not be allowed to contribute significantly to the error in the final 
prediction. Instead, one should respect the r = K(a) equation, 
solving it numerically whenever necessary, taking care that the 
numerical solution is accurate to several more decimal places than 
the minimum required for the final prediction to stay within its 
associated error estimate. 

There is another important reason not to use a series in powers of 
1/In(/A): truncating such a series introduces a spurious dependence 
on exactly how one chooses to define A. The definition of A made 
earlier corresponds to the particular choice for the constant of 
integration C(ô) in Eq. (6.18). That choice is quite elegant and 
convenient, but it is not unique and is just a convention. One could 
well have chosen some other C(6), differing by a finite, RS-invariant 
number. Indeed, the historical definition of A (without the tilde) is an 
example of a different choice. The existence of different conventions 
for how to define the A or A parameters creates no problem, since one 
can convert exactly between the different definitions. Using truncated 
series in 1/In(j/A) or 1/In(u/A) would spoil that exactness and 
introduce a new source of ambiguity. That ambiguity is wholly 
avoidable, and it is sensible to avoid it. 

The original definition of the A parameter, by Buras et al., 

was such that the series expansion of the couplant a as a series in 

1/In(u?/A?) had the form (in the original notation) 
pe l (Bann u?/A2) +E] 
Bo In(u?/A?) 3 In*(u?/A?) 
with the constant C chosen to be zero. Converting to our notation 


a = g?°/ (47°), b = 8r? 6o, c = 4r? 81/60, one can check that this 
corresponds to an approximate inversion of 


—+cln 
a 


b 
z + O(a) =u, 
instead of our form 


K(a) = L + cln |ca| + O(a) = bln(u/A), 


114 Renormalized Perturbation Theory and Its Optimization 


which follows from Eqs. (6.25) or (6.27), because A(a) or A(a) 
are O(a). Comparing the above equations shows that In(A/A) = 
(c/b) In |2c/b|, as stated earlier. 


6.5. RP Dependence of A: The CG Relation 


Having defined A we should now ask; is it RP dependent? The 
answer is “yes,” but in a simple and definite way, described by the 
Celmaster—Gonsalves (CG) relation. 


Theorem (Celmaster and Gonsalves). If two prescriptions (two 
schemes with the same value of u) are related by 


a’ =a(l+va+-::), (6.31) 
then 
In(A’/A) = v/b. (6.32) 
This result is exact and does not involve the v2, v3,... coefficients. 


Proof. We start from Eq. (6.17) and the corresponding equation in a 
different, primed, RS. Subtracting those two equations, with py’ = p, 


yields 
Cae dgr a" da! 


Evaluating for small a leads to 


in(A'/A) = G 7 =) +O(a)=0/b+O(a). (6.34) 
Now, since the left-hand side is independent of u, we can choose to 
evaluate the right-hand side at any value of u. Choosing u — oo (if b 
is positive), or u — 0 (if b is negative), makes a + 0 and so completes 
the proof. 

This proof, though it is sound, can seem too slick. The following 
alternative proof (due to Osborn) is instructive. First, one splits each 
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of the integrals in Eq. (6.33) into two pieces: 


dz dg £ 
N/A f E | S 
MEE ms (fg Be) Is POD Je FR 
- (6.35) 
Then one chooses ¢' to be related to € by 


e =e(1+vje+---), (6.36) 


with exactly the same coefficients as in Eq. (6.31). The second and 
fourth terms in (6.35) can then be shown to cancel exactly by making 
the change of variables 


=g(1 +v +), (6.37) 


and using the relation of 8’ to 8 of Eq. (6.8). One can now take the 
limit € > 0 (choosing, say, € = 26 or e = V6) and use the fact that 
B(x) ~ —ba? as x —> 0 to obtain the result (6.32). This proof makes 
plain that the result is non-zero only because of the 1/2? singularity 
of the 1/8 integrands and that the behaviour of (x) away from the 
infinitesimal neighbourhood of x = 0 plays no role. 

The importance of the Celmaster—Gonsalves relation is its 
exactness. It means that, although the A parameter is prescription 
dependent, the A’s of different prescriptions can be related exactly 
by a straightforward Feynman-diagram calculation of the single 
coefficient vı in the relation between the two prescriptions. Hence, 
the A parameter of any convenient “reference RP” can be adopted, 
without prejudice, as the one free parameter of the theory, taking 
over the role of the “bare coupling constant” in the theory’s 
original Lagrangian. This allows us to do the proper “book-keeping” 
when comparing theory and experiment for many different physical 
quantities. If two people choose to use different RP’s as the “reference 
RP,” that is not a problem: one can convert exactly between the A’s 
of the two conventions — and that is true in a practical sense, not 
just in principle. 


By contrast, reporting experimental results fitted to QCD pre- 
dictions in terms of an extracted numerical value for “a"(Mz)” 
creates a quagmire of unnecessary ambiguities. The MS scheme, 
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while convenient for diagrammatic calculations, is an arbitrary 
choice and leaves open the question of how to fix u. The conversion 
from ays(Mz) to the couplant of another RP can only be made 
approximately, with an uncontrolled error — for instance, does 
one express the new a as a truncated series in ays, or the other 
way around? Even to convert the scale from Mz (the Z-boson 
mass) to some other scale is ambiguous: It is not clear how many 
terms of the MS £ function should be used, especially when the fit 
has been made to various quantities, some of which are known to 
higher orders than others. For systematic comparisons of theory 
and experiment it would be sensible to use, say, Ams as the 
free parameter of QCD. If one ever wanted to convert to the A 
parameter of a different reference RP then that conversion could 
be done exactly, with no ambiguity. 

Of course, the value of Ags extracted from even very precise 
experiments tends to have a large error estimate, so one might 
diplomatically adopt the convention to quote, say, the value of 
In(1 GeV/Axs). 


Exercise 6.1. Consider a “delicate” theory in which b = 0, so that 
the 8 function starts at order a3: 


da 
wy, = Ala) = hah + grat gaa’ ++). 
(Assume h is non-zero.) 
(i) Repeat the ’t Hooft argument in Sec. 6.2 to show that h and 
g = 92 — g? are invariants. 


(ii) Show that a suitable constant of integration for the int-6 
equation (6.17) (with lims_,9 understood) is 


1 1 
C(8) = -7 (= = E +116) 


Note that this C(d) is now RS dependent (unlike the usual 
case), though in a simple way: 0C /0g; = 1/(hô). 
(iii) Show that the equivalent to the CG relation is 


~~ 1 
hln(A'/A) = —V2 + gıvı + zY 


so that the A’s of different RP’s are related exactly by a 2-loop 
calculation. 
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(iv) Revisit Exercise 3.2 and show that for a “delicate” theory the 
’t Hooft analysis implies that the bare coupling constant ap 
is of order 4/e. 


(See also Exercise 7.7 at the end of the next chapter.) 
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Chapter 7 


Parametrization of RS Dependence 
and the pn Invariants 


7.1. B Function Coefficients as Scheme Labels 


To make progress we now need to identify the “extraneous variables” 
associated with RS dependence: That is, we need a parametrization 
of all possible choices of RS. At first sight, the task might appear 
hopeless: There are all sorts of ways to define an RS, and they usually 
relate to very technical details of Feynman-diagram calculations, and 
involve lots of arbitrary choices and conventions. Looked at in the 
right way, however, the problem of how to parametrize RS’s has a 
quite simple solution. 
First, recall the int-3 equation, (6.17): 


In(u/A) = lim (= +c()) = ee (7.1) 


Since the integration constant C(d) is RS invariant, it is clear that 
a can depend on RS only through T = b In(y/A) and the scheme- 
dependent coefficients co,c3,... of the 6 function, because these are 
the only RS-dependent quantities (besides a itself) involved in the 
equation. The next step is to observe that a physical quantity R 
cannot possibly depend on any other RS parameters, because the 
cancellation expressed by the symbolic equation (6.2) could not occur 
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if the R coefficients depended on some RS variable that a did not 
depend on. Thus, as far as physical quantities are concerned, these 
variables must provide a complete RS parametrization: 


RS = {7T, c2, €3,...}. (7:2) 


Note the important point that u and A do not enter separately, but 
only through 7 = b In(s/A). 

The importance of this simple argument takes some time to 
appreciate, but it is crucial for all that follows. The reader is urged 
to re-read the preceding paragraph carefully and critically. 


Note that the claim that the parametrization by T,c2,c3,... 
is complete is qualified by saying as far as physical quantities 
are concerned: Non-physical quantities, such as Green’s functions, 
can depend on other aspects of the renormalization procedure. 
That is because Green’s functions depend on the wavefunction- 
renormalization constant, which cancels out in physical quantities. 
Equivalently, we can say that a Green’s function G has an “anoma- 
lous dimension,” oa that will depend on the renormalization 
procedure in ways other than through 7, c¢2,c3,.... This point 
is relevant to the problem of factorization-scheme dependence, 
discussed in Chapter 12. 


Because u and A do not enter separately, but only through T = 
b In(j/A), it is neither necessary nor sensible to fix, separately, the 
RP and the value of u in order to fix the RS. For instance, two 
RP’s that have the same c2,c3,... and differ only in their value 
of A, could be made entirely equivalent simply by redefining the 
u parameter of one of them. (MS and MS are two such RP’s.) As 
shown in the next chapter, “optimization” determines the optimal 
T for a given quantity at a given order, but it does not determine 
an “optimal y;” nor is one needed. (More informally, one might 
speak of an “optimal u,” if it is understood as relative to the A 
parameter of some specified reference RP.) 


In gauge theories some RP’s, particularly momentum- 
subtraction schemes, are “gauge dependent.” One might worry that 
the parametrization by 7,c2,c3,... is then somehow incomplete, 
but that it not so. In such cases, the RP is not fully specified until 
the gauge choice is made. Two such prescriptions, differing only in 
gauge choice, are not a single gauge-dependent RP, but are just 
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two different RP’s — and they are labelled by different values of 
C2, C3,-+ + 


The parametrization of RS dependence by the set of variables 
T, C2,€3,... is not the only possible parametrization (see Exercise 7.2), 
but it does provide a convenient “Cartesian coordinate system.” 
The partial derivative with respect to any one of these variables is to 
be taken holding the other variables of the set constant. That is, 0/07 
is taken with all the c;’s held constant, which corresponds to varying 
the renormalization scale u while holding the RP constant. Similarly, 
0/0c; is taken with 7 and the other c; (i # j) held constant. 

The symbolic RG-invariance equation (6.2) can now be written 
out explicitly as the following set of equations: 


R (ə B(a) ð B 
oF (alt ) mm) ® ~ 
OR ð ð a 
s(a taag) R= EE o oes 
involving some new functions (;(a) defined by 
Oa 
;(a) = og (7.4) 


These (;(a) functions are fixed in terms of the 6(a) function, as we 
show in the next section. 


7.2. The @;(a) and B;(a) Functions 


Consider the int-@ equation in the form (7.1). The left-hand side is 
T/b and depends only on 7, not on the c;’s. The constant C(d) is also 
independent of the c;’s. Therefore, applying the partial derivative 
ae to the equation leads to 


vO [a eh) cm, 


where the first term arises from the c; dependence of the limit 


of integration a, and the second term arises from the explicit c; 
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dependence of the integrand. Rearranging this equation gives us 


a gi +2 


B;(a) = —bB(a) dr —— (7.6) 


for J] = 2,3,.... 

The 8; functions begin at order afti (see Appendix 7.A for their 
series expansions). It is convenient to define B;(a) functions whose 
series expansions begin 1 + O(a): 


t=) 
B;(a)= O] (7.7) 
For j = 1, it is natural to define 
= _ Bla) 2 ` i 
Bı (a) = B(a) = pa? = 1 + ca + ca B ae (7.8) 


with the convention that co = 1 and cı = c. Equation (7.6) can then 
be rewritten as 


B(a)I;(a), (7.9) 


where 
gi? 


(Note that this formula for B;(a) even holds for j = 1 if the right- 
hand side is interpreted as the limit j — 1 from above. See also 
Exercise 7.3.) 

The B;(a) functions have the power-series expansions 


co 


B;(a) = So wje, (7.11) 
i=0 


with wä = 1. The other wj coefficients are fixed in terms of the c;’s. 
A convenient formula expressing that fact can be obtained as follows. 
Differentiating Eq. (7.6) with respect to a (holding the c; coefficients 
constant) yields the differential equation 


8;(a)B(a) — ;(a)8"(a) = —ba?*?. (7.12) 
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(This equation may also be obtained by considering the commutation 
of the second derivatives, 0?a/070c; = 07a/0c;0T; see Exercise 7.1.) 
In terms of the B and B; functions this becomes 


(j — 1)B;B + a( BiB — B;B’) = (j — 1). (7.13) 


Equating powers of a leads to the relation 
i x 
X +9 -1- 2m)cmW} m = (j — 1)6i0 (7.14) 
m=0 
for i = 0,1,2,... and j = 1,2,.... In the special case j = 1, one has 
W; = c; and the above equation reduces to 
i 
S G= 2m)emei-m = 0, (7.15) 
m=0 
which is true identically, since the left-hand side is 
i i 
`> (i — M)CmCi-m — >. MCm,Ci—m, (7.16) 
m=0 m=0 
and the first sum, by changing the summation variable from m to 
n = i — m, is seen to cancel the second. 


7.3. The p, Invariants 


The RG equations (7.3) determine how the coefficients r; of R must 
depend on the RS variables {7, c;}. To show explicitly how this works 
we specialize to the P = 1 case, where 


R=a(l+riatrea*+---) (7.17) 
and write out the lowest-order terms to obtain 
Or > Or 
2°71 3 O72 2 
enn ZA Se Ve 1 )(1T 42 es aaa 
(« ta = ) a (1+ca+---)(1+2ria+---)=0, 


(7.18) 


(oS +a Fete) Fa + Wate (14 ana b) =, 
2 


(7.19) 


124 Renormalized Perturbation Theory and Its Optimization 


and so on. (In fact, the coefficient W? is zero.) Equating powers of 
a, one sees that rı depends on 7 only, while rg depends on 7 and c2 
only, etc., with 


Or, 
—— =], 7.20 
OT ( ) 
Ore Org 
—9 n —=-l1, 7.21 
OT Ae cə ( ) 
etc. Upon integration one will obtain r; as a function of T, c2,...,G; 


plus a constant of integration that is RS invariant. Thus, certain com- 
binations of series coefficients and RS parameters are RS invariant. 
From Eqs. (7.20) and (7.21), one sees that 


p\(Q) =7- 11 (7.22) 
and 
p2 = C2 + r2 — Cri — r? (7.23) 


are invariants. 

The first invariant, p(Q), is unique in being dependent on the 
physical energy scale, Q. (Recall the discussion in Sec. 6.1, and 
Chapters 2 and 3.) A QFT calculation of the coefficient r1, in some 
arbitrary RS, yields a result of the form 


rı = bln(u/Q) +710, (7.24) 


whose u dependence indeed conforms with Eq. (7.20). For dimen- 
sional reasons, the u and Q dependences are tied together in r1. (The 
QFT calculation of rı does not “know” what boundary condition 
will later be applied to the 6-function equation, so the parameter 
A cannot explicitly appear.) Similarly, the higher coefficients ro,... 
depend on ln(u/Q), but not on u or Q separately. Hence, for the 
invariants p2, 93,... the cancellation of u dependence also implies the 
cancellation of Q dependence. However, p(Q) is different because its 
definition explicitly involves 7T: 


p1(Q) = bIn(u/A) — (bIn(u/Q) + P10) 
= bln(Q/A) — rio, (7.25) 
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where A is the A-parameter of the RP in which the QFT calculation 
of r1,. was done. Note that p,(Q) is both u independent and RP 
independent: If a is changed to a(1+v,a+---) then r1, changes, but 
so does bln Å by a compensating amount, thanks to the Celmaster— 
Gonsalves relation. Indeed, we may write 


p1(Q) = b n(Q/Ar), (7.26) 


where Ag is a characteristic scale specific to the particular physical 
quantity R. Knowing r1,o, we can relate it exactly to the A of the 
original prescription. 

Some convention must be adopted to uniquely define the higher 
invariants p; (for j > 2) because, of course, any sum of invariants is 
also an invariant. For example, one might quite naturally add some 
multiple of c? to Eq. (7.23). 


Indeed, a different definition was employed in the author’s early 


papers, with p94 = p3 — Ta and pot = 4p3. In later papers, the 


definition described below was used and distinguished by a tilde 
(“p,”). Here we shall dispense with the tildes. 


To define the p;’s we proceed as follows: For any given physical 
quantity R, one can always define an RS (known either as the “fastest 
apparent convergence” (FAC) or “effective charge” (EC) scheme) 
such that all the series coefficients r; vanish in that scheme, so that 
R = dapc(1+0+0+---). Since the 6 functions of any two RS’s are 
related by Eq. (6.8), we must have 


Bac(R) = =—B(a). (7.27) 


The pn invariants are defined to coincide with the coefficients of the 
EC-scheme $ function: 


Bec(R) = R? Y paR” = —ba? pla), (7.28) 


n=0 


where po = 1 and pı = c (not to be confused with the independent 
invariant p(Q) = T — rı, which will always be written with its Q 
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argument). Rearranging this equation as 


z RY A 
and then equating powers of a yields 
J i+2 
R 1 
ĉj = Vacs (=) + ; (7.30) 
i=0 Oa 


where C,,[F'(a)] means “the coefficient of a” in the series expansion 
of F(a)” 
The first few invariants, for P = 1, are as follows: 
pi=c, and p,(Q)=T-1, 
p2 = & — cry + r2- r, 
p3 = c3 — 2c2r1 + cr? + 4rÌ — Gryre + 2r3, (7.31) 
pa = Ca — 3c3r1 + co(4r? — re) + e(—2r1r2 + r3) 


— 14r + 28r?rz — 5r2 — 12rır3 + 3r4. 


The generalization to any P is as follows: 


ri 
pı 5C, and P(Q) =T- 7, 
1, r2 (P+!) 
paca Oo +S apr Th 
2 
E rı ri 2(P+1)\(P+2) 5, 2(P+2) 2r3 
Pi Cinta lag eee ey 
(7.32) 
Ti (P+ 7) 2 T2 
=e Se ee L 
p4 = C4 oF +o( zp ri 


ie (£ — 1)(2P +5) 3 E n =) 


i- a 


6P3 P 
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es 1P OSE PD) a y (2P + 5)(3P +5) > 


8p4 l 2p3 me 
(3P +7), 3(P +3) 394 
pp 2 pa ST 
Equation (7.30) generalizes to 
J (i+P+1)/P 
R 1 
Cj = dCs (=) J , (7.33) 
i= 


where S = =P Sq An inverse formula giving the p,’s in terms of 
the c; and r; “coefficients is 


R —(n+2P+1)/P 
Pn = Cn | B(a) (=) S?|. (7.34) 


Appendix 7.A: Expansions of the 3;(a) Functions 


The opening terms of the series expansions of the first few 6;(a) 
functions are given below. Note that the B;(a) functions are given by 
the series in parentheses, whose coefficients are the W? of Eq. (7.11): 


Bala) = a? fie E es 
3 6 
18c4 — 9cc3 — 2c2 + 3c? 
4 C4 wh c3 + 9C 2a! + 0(0°)) 
1 c e 6c3 + 2ce — 3c3 
B3(a) = =a (1 — -a + ae + a el + o(a')) : 


3 
c e — 2c 
bala) = aT (1 — 54 + aT, + o(«°)) : 
s(a) = 0° (1- Fa +00), 

Bs(a) = Ža" (1 + O(a)). 


These expansions of 8;j(a) are useful for formal purposes. How- 
ever, the approximate forms needed later are not truncated series, 


128 Renormalized Perturbation Theory and Its Optimization 


but correspond to the definition, Eq. (7.6), with 3(a) replaced by its 
truncated series. 


Exercise 7.1. Show that requiring commutation of the second 
derivatives, 0?a/070c; = 07a/0cj0T, leads to 


B; (a) B(a) — ;(a)B'(a) = —ba?*?, 


where the prime indicates differentiation with respect to a, regard- 
ing the coefficients c; as fixed. (Note that the coefficients of 6; (a) 
cannot depend on u by dimensional analysis.) Verify that the 
solution of this differential equation is Eq. (7.6). 


Exercise 7.2. Consider a more general way of labelling RS’s 
than the 7,¢2,c3,... “coordinate system.” Make the following 
minimal set of assumptions about these RS labels wy, ue, us,...: 
(i) the dependence of a on uj starts at order aft}, and (ii) these 
parameters are mutually independent. Also assume that u1 = T, 
which loses no real generality since we know that 0a/07T = 8(a)/b 
starts at order a?, and that the renormalization scale u must be 
involved in the RS labelling. Assumption (i) implies that 


— = Nja! (1 + Wia+ Wa? 4 e) , 
with some normalization N; and coefficients Ww . From assumption 
(ii) we must have 


Oa E Oa 
Orduj  OujOr 


Show that this leads to the conclusion that 


Ocj-i -0 
Ou; B 
Oc; 
—L=N,(j-1 
Ou; jl ), 
and 
Cji i À 3 zj 
Du; O a 


for j = 2,3,... and i =1,2,..., with cı =c and co = Wf = 1. 
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Note the important conclusion that c; is linearly dependent on uj. 
Check that if one chooses uj = cj then the W? coefficients reduce 
to the W? coefficients of the B;(a) functions. See Eqs. (7.11) and 


7 


(7.14). 


Exercise 7.3. Show that the RG equations for the c;’s, written in 
terms of the B;(a) functions 


(= a)t!B(a) 8 ) R=0, 
ocj 


Gj-1) ða 
can be taken to apply even for j = 1 if we consider “cı” to be the 
invariant c plus an infinitesimal part proportional to 7, in the sense 
that 


a 


“ce,” = lim (c — (j — 1)r) 
jol 
so that 
o 1 o 


da G- Hər 


(Note that, as discussed in Sec. 7.2, the formula for B; naturally 
leads to the identification of Bı (a) with B(a).) 


Exercise 7.4. Find the first three coefficients in the conversion 
between the couplants of two RS’s, a and ã: 


à = a(1 + Via + Vea? + Vza? +---). 


The two RS’s are labelled by T, c2,c¢3,... and T, C2, ¢3,..., respec- 
tively, where 7 = bIn(M/A) and 7 = bIn(M/A), and M and M 
are the renormalization-scale choices in the two schemes. (Without 
loss of generality, we can assume that the two RP’s are defined so 
that their A’s are the same.) 


Solution: The result will be needed in Chapter 12. Vi is easily 
found from considering the 7 and 7 derivatives. The other coeffi- 
cients can best be found from the relation (see Eq. (6.8)) between 
the two 8 functions: 


ta) = F(a). 


Indeed, the calculation is algebraically the same as that for finding 
the p invariants from Eq. (7.27), with the replacements R — å, 
pi — čj, and r; > Vi. Thus, we may use Eq. (7.30) with those 
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replacements, and then rearrange to solve for V;. The first three 
are 


Vi=T-T, 
Va = (T — 7)? + cfr — 7) — (c2 — ĉo), 

bai 10 355) 7 x. z 
V3 = (T— 7) zel F) + (—2c2 + 3č2)(T — ř) — 3 (63 — 3). 


Note that V; is independent of all c;’s with j > i. Note also that 
the 7 and 7 dependence is only through the difference T — 7, so 
that the V;’s depend on M and M only through the dimensionless 
ratio M/ M, as required dimensionally. However, note that the 
dependence on the cj, čj is not always through the difference c;—¢;. 
This fact, first seen in the cz, C2 dependence of V3, might seem oddly 
asymmetrical, but is in fact necessary for overall a + à symmetry. 
Reversion of a power series gives the inverse relation: 


a= à(1 + Vă + Vea? + Va +---), 


with 
Y=-Vi 
Vo = —Va + 2V7, 
=V; + 5V2V; — 5Vř. 


= 
Il 


One can check that, after substituting the above results for 
Vi, V2, V3, one indeed finds that Vi, V2, V3 are given by those same 
expressions with all tilde and plain variables exchanged. 


Exercise 7.5. Consider the successive logarithmic derivatives of a 
physical quantity R = aP(1+r,a+---) with respect to the physical 
energy scale Q: 


_ iRing] 
Riny =Q JQ 


for n = 1,2,3,..., where Ry) = R. These must all be physical 
quantities themselves. 


(i) In a fixed RS, the Q dependence comes only from the r; 
coefficients’ dependence on Q/ju. Use the u RG equation to 
show that 
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(ii) Apply the result again to get Rj) and hence show that the 


quantity 
Ry3] Te 
=B 10 7 
= Rig IIE Q 
is given by 
dB PR JOR 
Y= Ga gef a 


(y is an “effective exponent” in the sense that R is locally 
described, in the near neighbourhood of some particular energy 
Q, by a power law R ~ const. + CQ7. It is particularly 
interesting in the infrared limit, as we discuss in Chapter 11.) 


Exercise 7.6. (i) Continuing from the previous exercise, find 
the first few perturbative coefficients of Rj and hence find 
the invariants associated with it. Show that these are simply 
combinations of the usual p; (Q) and pn invariants for the original 
R. (Note that Rj} has a different leading-order power from R, 
namely P + 1 instead of P.) 


(ii) The result is rather simpler for Ry)/R, which is the scale 
dimension of R: 


Show that the invariants for D are related to those of R by 
PT (Q) = P(Q) -6 
p? = 2p2 — ac, 
p? = 3p3 — 8cp2 + 5c’, 
pr = 4p4 — 14cp3 — 6p3 + 30c?p2 — 14c’, 
for any P. (These last results are relevant in Chapter 12.) 


Exercise 7.7. Continue considering the “delicate” theory with 
b = 0 from Exercise 6.1. 


(i) Show that the u RG equation for R gives wOr,/Ou = 0. Thus, 
as one may also see from the discussion in Sec. 3.6, there are 
no ultraviolet divergences at next-to-leading order. 
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(ii) Show that the RS parameters can be chosen to be 


gı with oe = O(a°?), 
Ogi 
— X 4.1, Oa = B(a) = 3 
T=h ln(u/A) with s ~ O(a’), 
Oa 
. — j+1 
93, J4,- with Dor O(aJ™*), 


and that the analogues of the p, invariants, defined via the 
EC £ function, are 


0i = gi = Ti; 
T2 = g2 — 2171 +177, 
03 = g3 — 392r1 + 491r? — gre — 2rire + r3, 


and so on, together with a Q-dependent invariant 


~ 1 
o2(Q) = Aln(u/A) +r2 = ri — ae 
Note that o2 — o? = g, where g = g2 — g? is the invariant 
found in Exercise 6.1. Note also that g2 is not an independent 
scheme parameter; it is fixed in terms of gı by g2 = 02 — 


24 2 
oy + gi- 
(iii) Show that the analogues of the B;(a) functions, defined now 
as 
_ (j= 2) Oa _ 
B;(a) = git ig 
are 
“da ((1+ 2912) 
B =B 1- —= | — = - 1 
i= 80) (1-0 fs (Saar): 
and 
(j — 2) “oa . 
; > 
B;( ) qi-2 ( ) i ae Re (j 3), 
where 


rf 
t x 


B(a)=1+ gia + goa 


Since gə is not an independent variable, there is no B2(a). (In 
a sense it is B(a).) 


Chapter 8 


Finite Orders and Optimization 


8.1. Finite-Order Approximants 


So far the discussion has been at the level of formal power series. We 
now need, for the renormalized series case, something corresponding 
to the notion of the partial sum of an ordinary power series. Here 
two truncations are involved, for R itself and for 6. A definite 
approximation to 8 is needed because it is the int-G equation that 
relates a back to the theory’s one free parameter, the A parameter 
of some reference renormalization prescription. 

The (k+ 1)th order, or (next-to)*-leading order (N*LO) approx- 
imant is naturally defined with both R and 6 truncated after k + 1 
terms: 


RO Sa" trme, (8.1) 


where a here is shorthand for a+), the solution to the int-G 
equation with 8 replaced by B+): 


B+) = —ba?(1 tear + cpa"). (8.2) 


It is straightforward to check that the order of the error term 
R —R&+) is determined by whichever truncation, of R or £, is 
the more severe. Thus, it is natural to use the same number of terms 
in each. See Exercises 8.1 and 8.2. 


This is an open access book chapter published by World Scientific Publish- 
ing. It is distributed under the terms of the Creative Commons Attribution- 
NonCommercial 4.0 (CC BY-NC) License. 
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Other types of approximant can, of course, be defined: For 
example, one could use Padé approximant forms for R and/or for 
B. The issue of the RS dependence of such approximants, and 
how to “optimize” them, would require a fairly straightforward 
generalization of the following discussion for truncated power series. 


8.2. Optimization in Low Orders 


While the exact R is RG-invariant, the finite-order approximants are 
not, since the truncations spoil the cancellations in the RG equations 
(7.3). If R in those equations is replaced by REI) then the right- 
hand side is not zero but is some remainder term O(aP**+!). The 
idea of “optimized perturbation theory” is to choose an “optimal” 
k+1) is stationary with respect to RS 
(k+1) satisfies the RG equations, 


RS in which the approximant R< 
variations; i.e., the RS in which R 
(7.3), with no remainder: 


OT b Oa 


a 


( AR+)| Bla) s 
——— ee + a 
opt.RS 


AR (E+) OR (k+)) 
ka + Bi(a)—— =0. j7=2,...,k. (84) 
J opt.RS 


a 


We assume here that the QFT calculations of the R and 
6-function coefficients up to and including rọ and cg have been done 
in some calculationally convenient RS. From those results, the values 
of the invariants p(Q) and pı = c and p2,..., px can be obtained. 
The “optimized” result can be expressed solely in terms of those 
invariants, and thus has no dependence whatsoever on the choice of 
RS used for the Feynman-diagram calculations. 


First order (leading order), as mentioned in Chapter 2, is only a 
qualitative approximation. It is monotonic in the RS variable r and 
so is not optimizable. There are no invariant quantities, besides b. 
One cannot do better than to guess at a suitable u “of order Q.” 
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Let us now consider the second-order (next-to-leading-order) 
approximant: 


R® =a” (1 + ria), (8.5) 
B® = —ba?(1 + ca), (8.6) 


where a here is short for a), the solution to the int-8 equation (6.24) 
with 6 replaced by 6): 


ca 
l+ca 


r= Ka) =2+eln . (8.7) 


Since R) depends on RS only through the variable 7, only the 
“j = 1” equation (8.3) above is non-trivial. Thus, the optimized R®) 
is determined by a single optimization equation: 


an at! — g2(1 + ca)(pa’—! + (P+1)ra?) = 0. (8.8) 
= 


(Overbars are used to indicate the value in the optimum RS.) As 


P+1 


discussed in Sec. 7.3, the a terms must cancel in any RS, which 


fixes ors = P, leaving 


P — (1 + ca)(P + (P + 1)ria) = 0. (8.9) 

This equation determines the optimized coefficient 7, in terms of the 
invariant c and the optimized couplant a: 
P c 


= ea ee (8.10) 


But 7; is related to 7 by the definition of the p(Q) invariant in 
Eq. (7.32): 


pi(Q) =7-—- (8.11) 


Eliminating rı between these last two equations and substituting into 
the second-order int- equation, (8.7), gives 


al ca 


ter) aQ. 812) 


al = 


(1 + caln 


Cc 
1+ca 
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If we are interested in a specific Q value then we need to solve this 
equation numerically for @ (to a precision comfortably better than our 
estimated error for R). We can then substitute back in Eq. (8.10) 
and hence obtain the optimized approximant R) = aP (1 + 714). 
However, if we are interested in a range of Q then we may just pick 
many @ values, in a suitable range, and evaluate the left-hand side 
to find the p,(Q), and hence the Q value, corresponding to each of 
those a’s. 

Note that the only approximations made here are the truncations 
of the R and $ series, leading to Eqs. (8.5), (8.7) that define 
the second-order approximant is some general RS. We do not, for 
instance, approximate Eq. (8.10) as 7 ~ =F? (which corresponds 
to the PWMR approximation, discussed later). Nor do we make some 
uncontrolled analytic approximation to Eq. (8.12). At this order 
optimization, in practical terms, is no more complicated than the 
fixed-RS case, where we would have the int- equation to solve. 

We now turn to third order. The third order approximant is 


R® =a” (1 + ria + r2a°), (8.13) 


where now a is short for al), the solution to the int-6 equation 
with 8 truncated at third order. R) depends on RS through two 
parameters 7 and c2, so there are two optimization equations coming 
from Eqs. (8.3), (8.4). Those equations will mirror Eqs. (7.18), (7.19), 
generalized to general P. Using the counterparts to Eqs. (7.20), (7.21) 
they become 


1+ (£ i Da + c) a— B® (a) ( + C+), 4 4 Cina) = 0, 


P P 
(8.14) 


1- Boa) (1 + ed) t Daa + a] t “la =0. (8.15) 


Here BO) (a) is the B(a) function truncated at third order in the 
optimum scheme: 


B® (a) = (14+ ci + &a?), (8.16) 
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while the other function B® (a) is obtained from Eqs. (7.9), (7.10) 
with B(x) replaced by (1+ca+é@ x"). (Note that the only truncations 
made are the initial truncations of the R and B series that define 
the third-order approximant.) The two optimization equations can 
be combined to yield equations for 7; and Fə in terms of the integral 


-(3)_ fl dx 
L- = —— 8.17 
2 f (1 + cx + G27)? nn 


and @ and CG). Combining those equations with the definition of the 
invariant p2 will then determine Co in terms of a. Finally, a itself can 
be determined by combining the definition of the p,(Q) invariant 
with the int-3 equation. In the next chapter, we will explain a 
systematic method for solving the optimization equations in third 
and higher orders. 

Note that the “optimal RS” is not the same from one order to 
the next; for instance 7; at third order is not the same as 7; at second 
order (so, strictly we should have distinguished r” and 7) in the 
above). 


8.3. Perturbative Approximants as a Function of the 
RS Variables 


The following discussion is not essential, but the reader may find it 
helpful to see pictures of the low-order approximants as a function 
of the RS variables — akin to the figures in Chapters 4 and 5. (For 
illustrative purposes, we choose the case studied in Example 3 of 
Chapter 10, but the qualitative features are generic.) 

The (P = 1) second-order approximant R?) = a(1 + ria), at any 
given Q, is a function of just one RS variable, 7. The coefficient rı 
depends linearly on 7 and is T— p(Q), while a is a function of r found 
by inverting the second-order int-8 equation, Eq. (8.7). Figure 8.1 
shows R€?) as a function of r. The single maximum corresponds to 
the optimized result. 

One can avoid the inversion step by simply using the int-8 
equation to swap the variable 7 for a itself. (This may be a little 
mind-boggling at first, but is mathematically quite straightforward.) 
The approximant R) can then be expressed explicitly as a function 
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Fig. 8.1. The second-order approximant as a function of the RS variable 7 in 
the case corresponding to Example 3 of Chapter 10. 


of the new “extraneous parameter” a as 


R) (a) = a(1 + (K (a) — p,(Q))a) 


=a (2 +caln (=) = p: (Q)a) . (8.18) 


It is nicer to plot this against 1/a, rather than a itself; the qualitative 
behaviour is then similar to the previous figure. Figure 8.2 shows 
a close-up of the region near the maximum. The optimized result 
corresponds to the maximum of this curve. Thus, in this sense, the 
optimization condition is dR/da = 0. It is for this reason that we 
write OR/Oa for the derivative taken with the r; coefficients held 
constant. The total derivative dR /da here takes into account that the 
rı coefficient depends on 7 — and hence, from the current viewpoint, 
is a function of a. 

At third order there are two RS variables 7 and c2. It is again 
mathematically convenient to swap 7 for a. Now we must distinguish 
the usual OR/0a from OR/Oa),,, which must take into account the T 
dependence, and hence a dependence, of 71,72. Similarly, OR/Oc2|,, 
is distinct from the usual OR/Oc2, which is at constant T. (See also 
Exercise 8.3.) 

Figure 8.3 shows the third-order approximant RC) as a function 
of 1 and cg. For a fixed co value the curves generically have two 
stationary points or none. However, the optimal result, stationary in 
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Fig. 8.2. As the previous figure, but with 1/a used as the RS variable, and 
zooming in on the region near the maximum. The optimized result is 0.106 with 
an error estimate of +0.011. 
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0.080, 2-3 “49 12 14 16 
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Fig. 8.3. The third-order approximant as a function of the RS variables 1/a, c2, 
in the same case, and shown on the same scale, as the previous figure. The curves 
are for co = 0, i, I; 3 times the optimal c2 value, which is —15.9. The optimal 
+, c2 are indicated by the black dot and give the optimized result R® = 0.095, 
with an error estimate of +0.005. 


both 1/a and c2, is unique and corresponds to a saddle point of the 
function. 

It is instructive to compare this figure with those of Chapter 5. 
One sees the pattern of diverging, flat, and overdamped regions — 
and their characteristics show up both going from one order to the 
next and in the dependence on the other extraneous parameter, Co. 
Also note that the optimal value of 1/a has increased relative to the 
previous order, in accord with the induced convergence phenomenon. 
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8.4. The Optimization Equations 


We now formulate the optimization equations at some general, (k + 
1)th, order. It is convenient to define 


1 oR 


eee ES .1 
paP-1 ða’ (B10) 


whose series expansion 
S=14+sja+sga7+--- (8.20) 


has coefficients 


Sm = (==) rm. (8.21) 


P 


As will be seen below, the use of the Sm coefficients absorbs all the P 
dependence of the optimization equations. (However, P will reappear 
later when we need to combine their solution, obtained in the next 
chapter, with the P-dependent pn invariants.) 

Generalizing the discussion in Sec. 7.3 to any P, it must be true 
that all terms in the RG equations up to and including O(aP+*) 
cancel automatically in any RS. In the first optimization equation, 

AR (k+1) g A R 
(8.3), the =S5— | term is a polynomial which must cancel the first 


k terms of ae A similar observation applies to the other 


optimization equations, (8.4). Hence, we may reduce the optimization 
conditions to 


By) @SOM (a) — Tr BY @SEM@|!=0, (8.22) 


for j = 1,2,--- ,k, where, as in Sec. 5.4, the notation T,,[F'(a)] means 
“truncate the series for F(a) = Fo + Fia+--- immediately after the 
a” term” (i.e., Tp[F'(a)] = Fo + Fia+---+ F,a”.) 

For future reference, note that the 7 = k equation, where the 
To|...] term is just unity, gives 


StD (a) = ———. (8.23) 
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We emphasize again that the BEM (a) functions are not poly- 
nomials, but are given by Eqs. (7.9), (7.10) with B(a) replaced by 
B+) (a) =1+cat---+qa*. 


Exercise 8.1. Verify that the R and £8 series should be truncated 
after the same number of terms to have a formally consistent 
approximation. As discussed in Sec. 6.4, the int-3 equation T = 
K(a) can be used to express a as a series in 1/7 (with coefficients 
involving logarithms of 7). By using p,(Q) = T — 11 /P, one can 
then express a, and hence R, as a series in 1/p,(Q). Verify that 
the number of valid terms in the resulting series is controlled by 
whichever truncation, of R or 3, is the more severe. 

Show explicitly that if two terms of 8 are kept and at least two 
terms of R are kept, then 


P : 
e= (A) ("ase so (9P) 
Pı Pı Pi 


Keeping three terms of 8 and R, and noting that A(a) in 
Eqs. (6.27), (6.28) is (c? — c2)a + O(a?), the result can be extended 
to the next order in 1/p,. For p = 1 the result is 


1 ( clnp,/c | p2+¢? (In? p,/e—Inp,/c— 1) 
= eS Ee 
1 Pi Pi 


+o(28)) 
Pi 


Note that the r1,r2,c2 coefficients only appear in the invariant 
combination p2. 


[This expansion in 1/p,(Q) is suitable for the present formal 
purpose: It is not, however, a solution to the RS-dependence 
problem. The expansion parameter 1/p,(Q), though it is RS 
invariant, is rather arbitrary because the A definition is merely 
a convention. | 


Exercise 8.2. Show that “mixed-order” approximants, where R 
and £ are truncated at different orders, are not optimizable because 
they have a monotonic dependence on one or more of the RS 
parameters T, c2,c3,... and hence have no stationary point. 


Exercise 8.3. Show that the optimized result is independent of 
the particular choice of RS “coordinate system” used: that is, the 
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result for the optimized approximant me is the same whether 
one uses T,C2,C3,... aS scheme parameters or the more general 
parameterization by u1,u2,... as in Exercise 7.2. (This is just 
the obvious fact that the value of a function f(a1,22,...) at a 
stationary point is invariant under changes of variable 71, x2,...§ —> 
v',,x5,..., but it is perhaps instructive to check this.) Consider the 
case of Rl) explicitly; the generalization to any order will then be 
obvious. Show first that the optimization equations OR) /Ou; for 
i > 3 are identically satisfied, and that those for i = 1 and i = 2 
are a linear combination of the corresponding T and c2 equations. 


Chapter 9 


Solution for the Optimized rm Coefficients 
and Optimization Algorithm 


9.1. Definition of the H;(a) Functions 


In this chapter, it is implicit that all quantities are in the optimal RS 
at (k+ 1)th order; overbars and (k +1) superscripts will be omitted. 
Also, we make the convention that 


T9=So=CQ=l, and cj =c. (9.1) 


Next — for reasons that will become clear in the next section — 
we define some functions H;(a),...,H;,(a) that are combinations of 
the B,(a),..., Bx(a) functions: 


= Yaw (4) Bisa i=(1),2,...,8. (9.2) 


For 7 = 1 this definition, as it stands, is ambiguous; it should be 
interpreted as 


Hı (a) = Bı(a) = > cj? Bjala), (9.3) 


corresponding to 
l —  ]= A 
lin (=) al 540. oe) 


This is an open access book chapter published by World Scientific Publish- 
ing. It is distributed under the terms of the Creative Commons Attribution- 
NonCommercial 4.0 (CC BY-NC) License. 
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Table 9.1. The H; functions at low orders. (Ho = 1.) 


k Ay Hə H3 H4 
1 B 0 

2 B — ca Bz Bo 0 

3 B- caB2 = c2a? B3 B2 B3 0 
4 B — caBz2 — c2a? B3 — c3a° Ba Bz — Zc2a° By B3 + 3caB4 Ba 


It is also convenient and natural to define 
Ho(a)=1 and Ayyi(a) = 0. (9.5) 


Note that for the case i = k, the definition (9.2) gives H;,(a) = Bg(a). 
The H functions in low orders are given explicitly in Table 9.1. 

In general, the H’s are combinations of the B’s. It turns out that 
there is a simple formula for the inverse relationship, giving the B’s 
as combinations of the H’s. 


Lemma. 
k— or 


Hj+qla a), j = essere shes (9.6) 
q=0 


where the wj coefficients are those of the series expansion of B;(a), 
Eq. (7.11). (One might describe this result as follows: Take the power 
series for B;(a) and truncate it after the a*-J term. Now reweight 
each term, replacing a1 by a7 Hj+4(a), and the result is the full series 


for B;(a).) 


Proof. (We assume j # 1 for the present.) Using the definition of 
the H’s, Eq. (9.2), the right-hand side becomes 


Reorganizing the double sum by defining n = q + p converts this 
expression to 


k-j n 

a a” Byyn(a) . z 
D J n+ 5—1-—2p)c,W?_,. 9.8 
n=0 (j+n-— ape J p) P ' n—p ( ) 
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The inner sum reduces to (j — 1)dn0 by virtue of Eq. (7.14). Thus, 
only the n = 0 term of the outer sum survives, the (j — 1) factors 
cancel, and one is left with just B;(a), as claimed. 

In the j = 1 case Eq. (9.6) becomes 


k-1 
Bila) = > cya? Hg+1 (a). (9.9) 
q=0 


Using Eq. (9.3) for Hı (a) and Eq. (9.2) for the other H’s, the right- 
hand side becomes 


k-11 k—1 k—-q-1 G= 
Bı(a)— `> cja Byala) + 5 Cy? cja? ———~ Bg +541(a). 
æ e A Fa) 
J= q= j=0 
(9.10) 
Reorganizing the double sum by defining n = q + j yields 
k—1 , k-1 a” n 
Bi) ~ F650 Bjala) + > Banla) (20a ndean- 
j=l n=1 q=1 
(9.11) 


The inner sum, in parentheses, after adding and subtracting a q = 0 
term becomes 


n 
NCn + S (%4 — Nn)CgCn—q; (9.12) 
q=0 


which reduces to nc, since the summation term vanishes, as noted 
in Eq. (7.15). Thus, the two series terms in (9.11) cancel leaving just 
Bı(a), as claimed. 


9.2. Formula for the Optimized sm Coefficients 


We are now ready to state the main result; an exact, analytic 
expression for the optimized Sm, and hence the rm = PF Sm, 
coefficients, for m = 0,1,...,k, in terms of the (optimized values 
of) a and the 8-function coefficients c2, ... , Ck: 
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Theorem. 
qt 
Sm = (Hk-mla) — Hk-m+1(0)), m=0,1,...,k. (9:18) 
By (a) 


Proof. In the case j = k, as noted in Eq. (8.23), the optimization 
equation reduces to 
o 1l 
Bi(a) 
We first prove that this equation is satisfied. Substituting Eq. (9.13) 
into the series for S gives 
k 


k 
S= Yo ana” apa Hy-m(a) — He-m4i(a)). (9.15) 


(9.14) 


The H’s cancel in pairs leaving 


(Ho — Hk+1) = = (9.16) 


o 1 
B;(a) 
since we defined Hp = 1 and Hg+ = 0 above. 
Using this result and writing out the truncated-series term 
explicitly, the remaining optimization equations of Eq. (8.22) can 
be rewritten as 


=N N smWi, j=l,...,k-1. 9.17 
Bays do mW d (9.17) 


We now need to prove that these equations are satisfied by Eq. (9.13). 
The right-hand side becomes 


T 


=j i -m 


a YO Wi miga Ela) — He-msi(a))- (918) 
i=0 m=0 


>. 


Reorganizing the double summation, defining q = i — m and thereby 
replacing i with m + q yields 


1 k-j k—-j—q 
aW} X` (Hr-mla)— Hk-m+1(0)). (9.19) 
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The inner summation reduces to Hj4 (a) since the H’s again cancel 
in pairs (and Hyi,; = 0). Thus, the right-hand side of (9.17) 
reduces to 


1 < awi H. L 
Bata) 2: Wi Hj49(a) = Bla)’ (9.20) 


where the last step uses the lemma, Eq. (9.6), and produces the left- 
hand side of (9.17), completing the proof. 


9.3. An Identity and the PWMR Approximation 


It is worth noting the following set of “complete-sum identities:” 


k 
Y cja (==) Bij(a)=1, i= (1),2,...,k, (9.21) 
= i+3j-—1 

with the ¿ = 1 case interpreted using (9.4). The proof is given 
in Appendix 9.A, which discusses various properties of the J;(a) 
integrals related to the B;(a) functions. 

These identities reveal a remarkable property of the H;(a)’s, 
which are defined as a “partial sum” (over j = 0,...,k — i) of the 
same terms. Hence, we can write 


k : : 
fia p=] : 
H;(a)=1-— ea cja? (==) Bi+;(@), i= (1),2,...,k, 
Ekiti J 
J 
(9.22) 


which, unlike the H; definition, involves B;’s with j greater than k. 
Since the B;’s all start 1 + O(a) we see that the series for H;(a) — 1 


begins only at order a‘~**!; 
k—2i+2 7 
H;(a)- 1 = — ria" EHO): (9.23) 


Substituting this result into Eq. (9.13) leads to 


ipa (a yoat- (1 a tena) EEE SN 
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and hence 


k—2 
Sm = — a Oe), (9.25) 


This result was first obtained — in a quite different manner — by 
Pennington, Wrigley, and Mignaco and Roditi (PWMR). The result- 
ing PWMR approximation (see Exercise 9.1) can be useful when 
a <1 (one also needs a < a* if a finite infrared limit a* exists) and 
it provides a good starting point for the full optimization procedure. 
(Obviously, post-PWMR, post-post-PWMR, ... approximations can 
be obtained by expanding further in powers of a; see Exercise 9.3.) 


9.4. Results in Terms of the I; Integrals 


Recalling that the B;(a) functions are related to the J;’s by Eq. (7.9), 
we may rewrite the H’s as sums of J’s rather than B’s: 


k—i 


B(a) 
= X (i-j- Dejlige T= 2k; (9.26) 
j=0 
and 
Hy (a) = Ba) | 1- X jel J. (9.27) 


Note that at (k+1)th order only Io,..., Ip arise. The Sm coefficients, 
from Eq. (9.13), can thus be expressed in terms of the J;’s (see 
Exercise 9.3). The result, defining c_; = 0, is 


1 ld 
Sm = E DG 2 -mi (œ -m -— j- le; 
j=0 
| a j=l 
(k-m—j+1)= ), (9.28) 


for m = 1,...,k — 2. The cases m = k — 1, k, involving H1, Ho, need 
special treatment. Those results are 


_ 1 — l ss : Cj-1 
Sk-1 = -Dh t= dT (ic (J 24) (9.29) 
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Table 9.2. The sm coefficients at low orders, in the terms of the J; integrals. 
B= ame Cj a’. 


k Sm coefficients 


1 8i =-$. 
2 na-e-d 4d, n=- 
3 si =-i+ 5, s2 = —c2 + yy (a — (1+ ca)I2), 
s3 = 2 + qy (5 -~ 1+ ch). 
4 9 =g-14 3, s2 = — 29 4 (ale — 21s), 
s= (eases) a son s4= B+ T (4 —1+chk + 2ceI3). 


(a — (1 + ca) Iz — 2c2a 13), 


and 


k-1 
1 1 
eee | Si el ax W. 
“= hak \B tna ia (9.30) 


For low orders, the results are collected in Table 9.2. The J; integrals 
are readily evaluated by computer algebra; see Table 9.3. Properties 
of the I’s are discussed in Appendix 9.A. 


9.5. Optimization Algorithm 


The optimization problem at (k+1)th order involves 2k+1 variables, 
namely, a, T,c2,...,Cp, and r1,...,7;. These are connected by 2k+1 
equations, namely, the int- equation, the k optimization equations, 
and the k formulas for the invariants p,;(Q) and p2,..., ppg (whose 
numerical values we assume are given — in the case of p,(Q) the 
numerical value will depend on the value of Q being considered). 
We shall use a,c2,...,Cķ as the principal variables. The solution 
to the optimization equations just discussed gives the coefficients 
r1,---,7~ directly in terms of the principal variables. The int-@ 
equation explicitly fixes 7 in terms of the principal variables. The 
pı(Q) = T — rı equation can be used at the end to relate a to 
Q, so the remaining task is to use the formulas for po,...,p% to 
determine, self-consistently, by some convergent iterative procedure, 
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Table 9.3. Mathematica notebook to evaluate the I2,J3, and A 
integrals for the k = 3 case. The output expressions are not shown for 
reasons of space. (To avoid ConditionalExpression forms, it is tacitly 
assumed that a is less than the smallest positive root of B(x), as must 
be true in the present context.) 


Inl # $Assumptions = {c € Reals, c2 € Reals, c3 € Reals, a > 0}; 
In| # B=1+cx+c2 x^2 + c3 x43; 

In| # int2 = Integrate [ 545, x] ; 

Inl # Ii2 = (int2 /. x > a) — (int2 /. x > 0) 

Outl#] — 

In| # int3 = Integrate [sx] ; 

In| # li3 = (int3 /. x > a) — (int3 /. x > 0) 

Outl#] — 

In| # Deltwint = Integrate[(1/x*2)(1/B — 1+ ¢ x), x]; 

In| # Deltw = (Deltwint /. x > a) — (Deltwint /. x > 0) 


Out([#] = 


the c2,...,c, variables. One such algorithm is the following (recall 
— (P+m . 
Sm = ( 5 JT: 


(1) Choose a numerical value for a. 

(2) Make an initial guess for the numerical values of 51,... , Sx. 

(3) Find values for the c;’s from the invariants using Eq. (7.33). 

(4) Obtain new values for the Sm from the formulas of the preceding 
section. 

(5) Iterate from step 3 until the results converge to the desired 
precision. 

(6) Finally, use 


p1(Q) = Ž + eIn [eal sA (=) 5 (9.31) 


(from the definition of p(Q) as T — rı combined with the int-6 
equation T = K(a) in the form of Eq. (6.27)) to find the value of 
Q that corresponds to the chosen a value. One can then repeat the 
whole procedure with different initial a values to cover the desired 
range of Q values — or to home in on one particular Q value. 
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It is usually convenient to start at a very small a value and use the 
PWMR approximation (see Exercise 9.1) for the initial sm values. 
One can then move step-by-step to larger a values (and hence lower 
Q’s) using the previous sm’s as the initial guess for the next step. 

To illustrate the algorithm we show, in Table 9.4, a Mathematica 
notebook for the fourth-order (k = 3, P = 1) case. The desired values 
of c, p2, p3 and a should be entered in place of those used there (which 
correspond to Example 3 in Chapter 10). 

Various details of the algorithm can be refined. It appears to be 
quite robust and efficient except in the far-infrared region, where 
more elaborate numerical analysis techniques may be needed. The 
infrared limit can be analyzed analytically, as will be discussed in 
Chapter 11. 


In the case of a fixed-point limit, where the 6 function has 
a zero at a = a“, one can avoid the worst problems at low Q by 
iterating at a fixed B(a) value rather than at a fixed a value. That 
is, after step 3, one constructs the new B(x) function and solves 
for a new a from B(a) = Bo, where Bo is the value obtained in the 
first iteration. 


Is the optimal solution determined uniquely? This seems 
very likely, due to the fact that k extraneous parameters are 
involved at (k + 1)th order. A general proof appears difficult, 
but at large Q, where the PWMR approximation is good, the 
uniqueness can be proved easily: The PWMR cj coefficients are 
fixed directly by the values of the ppn invariants (see Exercise 9.1), 
leaving Eq. (9.31) to fix the optimal a. The right-hand side is easily 
seen to be a monotonic-decreasing function of a, while the left-hand 
side is a-independent; hence, their intersection point is unique. 


Appendix 9.A: Properties of the I; and J; Integrals 


In this appendix, we discuss some properties of the integrals 


Tia fa Ja fa = (9A.1) 
(a) = L——,, (a) = r=—. : 

i o B(x)? : o B(x) 

It should be understood that we are considering (k + 1)th order, so 


that “B(x)” is really shorthand for B@+) (x) =1+ca+---+ cpr". 
Unless otherwise stated, we assume 7 > 2, so that the integrals 
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Table 9.4. A basic program to implement the optimization algorithm for p = 1 
at fourth order (k = 3). Expressions for [2,J3, A should be cut-and-paste into the 
opening lines from the outputs of Table 9.3. 


Ini #] 


li2 = ...; [i3 = ...; Deltw = ...; 


In[ #] 
Int #] 


Inl #] 
In[ #] 


In[ #] 
In[ #] 
Inl #] 


In[ #] 


In| #]) 


In[ #] 


Out[#] 


In[ #] 


c = 115/58; p2 = —9.92498; p3 = —115.21021; 

c2frinv = p2 — (r2 — crl — r1?) /. {r1 > s1/2, r2 > s2/3}; 
c3frinv = p3 — (—2c2r1 + crl? + 4r1? — 6rl r2 + 2r3) /. 

{rl > s1/2, r2 + s2/3, r3 — s3/4}; 

reslist = {}; 

(* Initial guesses for s1,s2,s3. PWMR used here. *) 

sl = $; s2 = —3 (p2+ 4c’); 

s3 = —2 (p3 + 4p2c+ gz); 

(*%%%%*) 

aa = 0.0899359; n = 0; 

(*#HHH*) 

n=n+ l1; 

c2 = c2frinv; c3 = c3frinv; 

Bval = 1 + c a + c2 a^2 + c3 a^3 /. a — aa; 

li2val = Refli2 /. a— aal]; 

li3val = Re[fli3 /. a > aal; 

sl = —1/aa + Ii2val/(2 li3val); 

s2 = —c2 + (aa — (1 + caa) li2val)/(2 aa li3val); 

s3 = c2/aa + (1/Bval — 1 + c li2val) /(2 aa Ii3val); 

RR[n] = a(1 + (s1/2)a + (s2/3)a*2 + (s3/4)a*3) /. a aa; 
(*#HHH*) 

(* Evaluate the preceding cell several times until satisfactorily converged. 
Use ListPlot[Table[RR{i], {i, 1, n}]] to view convergence. *) 
tholQ = (1/a + c Log [Abs [c a] ] — Re[Deltw] — s1/2) /. a — aa; 
AppendTo [reslist, {aa, rholQ, RR[n], c2, c3, s1, s2, s3}] 
{{0.0899359, 6.02383, 0.0913479, —7.42689, —230.263, 
—0.125903, —7.85684, 233.94} } 

(*Return to (*%%%%*) and enter a somewhat larger aa value, 


thereby moving to lower Q. *) 
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are convergent. These integrals may be evaluated analytically by 
expressing the polynomial B(x) as a product of its factors and using 
partial fractions. (The resulting expressions are cumbersome, and 
complicated by the possible presence of complex roots in pairs, but 
are readily handled by computer algebra programs. Note that, in the 
present context, a will naturally always be smaller than any positive 
root of B(z).) 
The J’s are given by a sum of I’s: 


k 
J; = ~ Cj Tags (9A.2) 
j=0 
The proof is simple: The right-hand side is 
k a i+j—2 a i—2 k 
x x , 
c; | de= = f dr—— ) G, (9A.3) 
def ae ~ hae 


and the sum gives B(x), which cancels with one of the B(x) factors 
in the denominator, so that the integral reduces to Jj. 
A set of “complete-sum identities” follows from the fact that 


a d git at! 
oe RRS DEER Ad 
e a - Be paa 
The left-hand side is 


a l gi-2 git dB k l a pitj—2 
[e (6-05a Bape) TED- dis f BGP 


k k 
=(i=1) $ clas- Do jolas 
j=0 j=0 


k 
=) =j- Igla QAD) 
J=0 


Thus, we obtain a set of complete-sum identities: 


k 
X (i -j - 1)cjIitj = Bla’ i= (1),2,3,.... (9A.6) 
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In the special case 7 = 1, the result should more properly be written 
as 


k 
X (-i)c; Lj = = Fao (9.7) 


j=l 


where the sum starts with 7 = 1. (As a mnemonic, one can regard 
the 7 = 0 term, which has a vanishing coefficient times a divergent 
I, as producing a 1 that is taken to the right-hand side.) 

Recalling that the B;(a) functions are related to the I;’s, we can 
rewrite the complete-sum identities, Eq. (9A.6), in terms of B;’s to 
get 


ae atil Bijla) ah 
Da-i- pa ae 9 


which, when divided by a’~!/B(a), gives the result in Eq. (9.21). 
(The i = 1 case needs special consideration, but can be easily 
checked.) 

In the factorization-scheme-dependence problem, discussed in 
Chapter 12, we also encounter the integrals Jķ}1 and Isķ4}ı at 
(k + 1)th order. These can also be reduced to combinations of 
Ig,...,I,, by the earlier formulas plus the identity 


y Jgd = ln B(a), (9A.9) 
which follows from 
k 
a 1 ar a j-1 
pags l- Pare Soe = Die ears 
j=l 
(9A.10) 
For cases involving I; or J; for 7 = 1 or 0, we may define 


“regulated” versions of the integrals by subtracting off the inverse 
powers of x in a Laurent expansion of the integrand. However, the 
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only such case we really need mention is 


A i = ia 1 1 — cx 


which is the integral occurring in the int-8 equation (see Eqs. (6.27) 


and (6.28).) In practice, this integral is best evaluated directly, but 
it is noteworthy that it can be expressed as a sum over convergent 
J integrals: 


z a 1 k i 
A= f EE (1-0-6 a) 


i=0 
j 1 : } k+1 
= i 4 amg e 
| E 2a 1 — G )L’ + CCRT 
k—1 
=5 i Cj = Cj+1) Jj+1 + CCkJk+1- (9A.12) 
j=1 


The last term, by using the J complete-sum identity, Eq. (9A.9), can 
be expressed as 


k-1 
C r 
cckJi+1 = 7 | n B(a) — 3 jcjJjy |, (9A.13) 
so that 
k-1 _ 
A= Ein Bla) +> — J) coy = oy] ia. (94.14) 
J= 


Note that the J’s involved can be expressed, using Eq. (9A.2), as 
a sum of J;’s with j = 2,...2k. Then, by using the complete-sum 
identities to substitute for Iķ+1,..., I2ķ, one can reduce the result to 
a sum over just [g,..., Ip. (See Exercise 9.4.) 


Exercise 9.1. Combine the PWMR result, Eq. (9.25), with the 
formulas for the p; invariants to solve for the PWMR. rm coefficients 
in terms of the invariants for second, third, and fourth orders. 
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For Pp = 1 you should find rı = =a at second order; rı = 0, 
T= -302 at third peder and ry = aC, Py = sali aac ae 
r3 = —5 i (ps + {p2c + a re è) at fourth order. 


Exercise 9.2. Express the formula for the sm coefficients, 
Eq. (9.13), in matrix form, with the column vector of 8,,’s (with 
m = Sma™ and 8) = 1) given by 1/B, times a matrix of 0’s and 
1’s times a column vector of H;’s. Then find the matrix inverse. 
Substitute in the lemma, Eq. (9.6), to show that the column vector 
of (1, B, Bo,..., Bp) is given by the matrix 


1 1 1 seer ol, dl 
Ty-1(B) Tk-2(B) Tr-3(B) ... 1 0 
B mP 
k | To(Bk-2) Tı(Br-2) 1 0 0 
1(Br-1) 1 0 0 0 
1 0 0 0 0 
times the column vector (1, 81, §2,..., p). 


Exercise 9.3. 


(i) Derive Eq. (9.28). 
(ii) Obtain Eqs. (9.29, 9.30) for the special cases sx and s,_1. 
(iii) Use the complete-sum identities (94.6), to obtain the formula 


1 
Sm = t-i (k — 2m) jad bp 3 Cj 
j=m+1 
r Le-m+j+1 : 
x (k -m = J) — (k= m- j — 1)Ik-mj , 
which gives compact results for Sk, Sk—1,..., but is more 
cumbersome for s1,52,.... Note that it involves integrals 


beyond Iķ, up to Igx-m+41 (and so up to Ip, for m = 1). 
The first term directly yields the PWMR approximation. 

(iv) Use the series expansion of the J;’s to obtain the post-PWMR 
approximation 


_ (k— 2m) i 2ca 2(m+ 1) 
Sm = k Cm k(k +1) K(k -+ 1) Cm+14. 
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Exercise 9.4. From Eqs. (9A.14) and (9A.2) show that for k = 2 
x c e 
A= 3 n B(a) + (> = o) (Ip + cl3 + C214). 


Using the complete-sum identities of Eq. (9A.6), eliminate Jz and 
I4 and hence express A in terms of Ig: 


3 2_9 
A=<in TTE (e — 4c2)h — (c? — 2c2 + ceza) ). 
2 Aco 


B(a) 
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Chapter 10 


Numerical Examples for Re+e- in QCD 


10.1. Re+e- and Its QCD Corrections 


We turn next to illustrative numerical results for a specific physical 
quantity, namely the QCD corrections to the ratio 


Ree- = al e ees): (10.1) 
o(ete” => utu) 


In the parton model, where quarks are treated as free particles, the 
cross section to produce a quark-antiquark pair, of a specific flavour, 
is exactly the same as that for production of a wtp pair, except 
for a factor of the quark charge squared, q?, and a factor of 3, since 
each flavour quark comes in three colours. It is assumed that the 
quarks will later “fragment” into hadrons. How that happens does 
not matter; it happens with probability one, and does not affect the 
cross section, which is determined by the probability of producing 
the qq pair in the first place. Thus, neglecting masses, the parton 
model prediction is just 


A as a: (10.2) 
i 


where the sum is over all flavours. 
Quarks, however, are not free; they may radiate gluons and 
exchange virtual gluons, giving rise to QCD corrections. The 
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leading-order correction has a coefficient of unity times the QCD 
couplant, a, so we may write 


Rete- = (Ze) (1 + Rete-); (10.3) 


where Re+e- is a normalized physical quantity whose QCD pertur- 
bation series has the form 


Rete- =A(1+r1a+r2a7 +--+). (10.4) 


It depends upon the energy scale Q by the magic of Dimensional 
Transmutation, as discussed in Chapter 2. (In this specific context, 
we choose to define “Q” as the total ete” centre-of-mass energy.) 
The coefficients r1, r2, and r3 have been calculated, as have the 
G-function coefficients c2, c3 (all in the MS scheme), and are quoted 
in Appendix 10.A. Thus, we are able to obtain second-, third-, and 
fourth-order approximants. Our focus here will not be on comparison 
with experimental data (which would require discussion of other 
issues) but on the apparent convergence, or otherwise, of results from 
one order to the next. 


In the real world, quarks have masses. Calculating QCD radia- 
tive corrections including quark masses is much harder, so generally 
quark masses are neglected. This means that one is approximating 
“real QCD” with a set of effective theories, each with a different 
number of massless quarks. The number of “active quarks,” np, 
depends on the physical energy scale Q being considered. Near a 
flavour threshold, at Q = 2m,, the quark mass needs to be allowed 
for, at least kinematically. For Re+e-, the result including masses 
at first order (where R ~ a) is 


Rete- = 3). q? T (vi) (1 + gvi yR) , 


where 


vi = 4/1 — 4m? /Q?, 


T(v) = v(3 — v?)/2, 


w- C) G-a)] 
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(The formula for g(v) is a simplified approximation to the actual, 
rather cumbersome, formula.) At higher orders, we may use the 
above form but with the R calculated to higher orders in the mass- 
less effective theory with the appropriate ns. The Ays parameters 
of the different effective theories then need to be Daed at the 
flavour threshold so that R is continuous. For a fuller discussion of 
the phenomenological issues, see Ref. [15]. 


10.2. Procedure 


We shall compare the optimized results with the conventional 
approach, which is to use the “modified minimal subtraction” 
(MS) prescription with the renormalization scale u chosen equal to 
the centre-of-mass energy, Q. (Properly speaking, then, the RS is 
“MS(u = Q).”) We will essentially presume that the value of Äss 
is known from fitting other experimental data. However, to avoid 
committing to any specific value, we label our examples, not by Q, 
but by the ratio of Q to Ayg. At each order we proceed as if only the 
coefficients to that order had been calculated. 

To obtain the MS results, at a given Q/ Ayes value, the first step is 
to evaluate the numerical value of the T par oe of the MS(u=Q) 
scheme: 


T = bIn(Q/Aqs). (10.5) 


One must then numerically solve the int-3 equation. At second order, 
where B(a) is approximated by 1+ca, that equation is 7 = K® (a), 
with K)(a) given by Eq. (6.23). With the resulting a, one then 
evaluates Re = a(1+r™Sa). At third order one must numerically 
solve 

TS = KÊ (a) = - + eln|ea| — A® (a), (10.6) 
where A(a) is given by Eq. (6.28) wa in this case, B(x) approxi- 
mated by 1+ cx + cx? with cp = gs S. With the resulting a one then 
evaluates RË = = a(1 +r}"a +r}5a?). At fourth order, the procedure 
is tl e same, except that one now includes a c} term in B(x) and an 

S term in R. 
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To obtain the optimized results, one first needs to calculate the 
numerical values of the invariants. At a given Q/ Aue, one finds 7“ 
from Eq. (10.5) and then obtains p,(Q) as 7S — r]. The p2 and 
p3 invariants, which are Q-independent, are similarly obtained by 
evaluating their definitions in Eq. (7.31) using the MS-scheme r; 
and c; coefficients: The p2, p3 results are quoted in Appendix 10.A. 
The optimized result to second order is obtained from Eqs. (8.12) 
and (8.10) with P = 1. At higher orders, one can use the algorithm 
described in the preceding chapter. 

At each order one wants, not only a result for R but also an 
estimate for its likely error. There is no rigorous way of doing this. 
However, it is reasonable to expect that the “apparent convergence” 
of the series (i.e., the behaviour of the terms that have been 
calculated) is some sort of guide. We shall adopt the common practice 
when dealing with asymptotic series of viewing the magnitude of the 
last calculated term, |r,a**+!|, as the error estimate. We do this both 
for the MS and the optimized results. The change in the R results 
from one order to the next — which for optimization is not the same 
thing — provides another indication of the likely error; it seems quite 
consistent with our error estimate. 

We will give two sets of examples; one set at moderately high 
energies, and the other at low energies. For the first set of examples 
the phenomenologically appropriate number of flavours is np = 5 
(u,d,s,c,b quarks), while for the second set it is ns = 2 (u, d quarks 
only). Results for (k + 1)th order (k = 1,2,3) in both the MS and 
optimized schemes are presented in the tables and figures below. 


10.3. High-Energy Examples 


For nf = 5 the 6-function’s leading, RS-invariant coefficients are 


23 29 
b= — = —. 10. 
$ c a3 (10.7) 


In the MS scheme, its next two coefficients are 


ms 9769 
MS _ ~~ _ 1474 10. 
OP = a 74789, (10.8) 
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E 26017 11027 
= = a 
2 31104 1242 


(3 = 9.835916, (10.9) 


where ¢, is the Riemann zeta function. The MS coefficients in 
Ri(ete) are 


r75 = 1.409230, r35 = _12.80463, r35 = —80.43373. 
(10.10) 
(The exact values, involving C2, ¢3, ¢; and ¢7, see Appendix 10.A, 
were used in our calculations.) Inserting these values in Eq. (7.31) 
yields 


p2 = —15.0926, p3 = —33.2216. (10.11) 


Our first two examples are cases that have been discussed previously 
in the literature. 


The history of these two examples is interesting. Back in 1988, 
after publication of a result for r}5, it had seemed that “opti- 


mization” gave very unsatisfactory results; the optimized couplant 
increased from second to third order, the apparent convergence was 
poor, and the error estimate was large; worse than MS. It turned 
out, though, that the original calculation of r¥5 was incorrect. 
When the correct result was published in 1991, the situation was 
transformed. This unfortunate history does at least illustrate the 
fact that the improvement provided by “optimization” is not trivial 


or accidental. The more recent result for r3'° provides further 
confirmation that optimization works as advertised. 


At these energies, the perturbation series seems well behaved. 
The MS results are quite satisfactory and optimization provides only 
a slight improvement. It should be borne in mind, though, that the 
popularity of the MS scheme — over the original minimal-subtraction 
scheme, for instance — was in part due to it giving sensible-looking 
results for R.+-- when u equals the centre-of-mass energy. Applying 
MS to other quantities requires, each time, some new ad hoc guess for 
the appropriate u, whereas optimization is systematic. While opti- 
mization here offers only a small improvement, it is an improvement: 
there is slightly greater precision, with smaller expected errors that 
shrink more rapidly with increasing k. It is also noteworthy that 
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while ays slightly increases with k, the optimized couplant a shrinks, 
consistent with the “induced convergence” scenario. 


Example 1: Q/Ays = 340. 


Table 10.1. Results for R, the QCD corrections to Re+e-, in 
(k+1)th-order (N*LO) at an energy Q/Ays = 340. The upper and lower 
subtables list, respectively, the MS and optimized results. The columns 
give the couplant value, the rough form of the series, and the result for 
R with an error estimate corresponding to |r,a*** |, the magnitude of 
the last term included in the perturbation series. 


0.0381237 1 + 0.05) 0.04017[205] 


0.0382058 .04(1 + 0.05 — 0.02) 0.03955[71] 
0.0382161 1 + 0.054 — 0.019 — 0.004) | 0.03939[17] 


0.0414570 1 — 0.02) 0.04043[108] 
0.0394420 .04(1 — 0.01 + 0.01) 0.03944 
0.0391507 .04(1 + 0.003 + 0.002 + 0.001) | 0.03941 


0.043; - 
0.042: Q/Auys = 340 
0.041 - 

R 0.040 : i 
0.039: n 
0.038 £ 
0.037 * 


Fig. 10.1. Results for R, the QCD corrections to Re+e-, in (k + 1)th-order 
(NELO) at an energy Q/ Ñy = 340. The larger, blue points displaced leftwards 
are in the MS scheme, while the smaller, red points displaced rightwards are the 
optimized results. In both cases, the error bars correspond to |rra"t! |: 
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Example 2: Q/Ays = 68. 


Table 10.2. Results for Q/Ayg = 68. 


[ov Toes [Remsen 


0.0507097 + 0.07) 0.05433 
0.0509032 : + 0.07 — 0.03) 0.05287 
0.0509356 5(1 + 0.07 — 0.03 — 0.01) 0.05236 


0.0568587 1 — 0.03) 0.05496 
0.0525541 .05(1 — 0.02 + 0.02) 0.05256 
0.0520416 .05(1 + 0.002 + 0.003 + 0.002) | 0.05245 


0.058 | Q/Ams = 68 
0.056; 
R 0.054! 
0.052} . + 
0.050 
1 2 3 
k 


Fig. 10.2. Results for Q/Aws = 68. 


10.4. Low-Energy Examples 


Next we turn to lower-energy examples, where the differences 
between MS and OPT become more dramatic. With nz = 2 the 
G-function’s leading coefficients are 

29 115 


= = 10.12 
b=, c= =, (10.12) 
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and 
XB 48241 
2 8352 
ms _ 18799309 n 68881 
~ 902016 12528 


C3 
The R coefficients, in the MS scheme, are 


= 5.77598, (10.13) 


Cs = 27.45054. (10.14) 


riS = 1.755117, r35 = —9.14055, pie = —123.18799. 
(10.15) 
(Again, the exact values were used in our calculations.) Inserting 
these values in Eq. (7.31) yields 


p2 = —9.92498, p3 = —115.21021. (10.16) 


Examples 3-7 give results at successively lower energies; 
Q/Ams = 5,2,1.7,1.5 and 0. One sees in the MS results the char- 
acteristic symptoms of an asymptotic series; after initially seeming 
to converge, the series starts to go bad, with the error estimate 
increasing with order. In Example 3, the effect is just visible in the 
k = 3 result, but it becomes more dramatic in Examples 4 and 5. 
In Example 6, there is no k = 3 MS result at all since there is no 
positive-a solution to the k = 3 int-G equation. At still lower values of 
Q/ Ags, the k = 2 and k = 1 MS int-8 equations have no acceptable 
solution. 

In contrast, the optimized results show a monotonic decrease in 
the expected error at higher orders. The k = 1 results, in Examples 
5 and 6 particularly, are very uncertain at low energies — indeed, for 
Q/Aws < 1.438 (corresponding to p(Q) < 0) there is no solution to 
the k = 1 optimal int-G equation, Eq. (8.12). However, for k = 2 and 
3 the optimized results improve very significantly — and continue 
smoothly down to zero energy. 


A Landau pole is said to occur when the solution for a goes to 
infinity at some finite Q value and becomes negative at smaller Q. 
In MS the Landau pole is at Q = Ags at k = 1, and at Q/Ays = 
1.396 and 1.645 at k = 2,3, respectively. In OPT, because c here 
is positive, there is a Landau pole when k = 1, but it is absent for 
k = 2,3. These matters will be discussed in the next chapter. 
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Example 3: Q/Ays = 5. 


Table 10.3. Results for Q/Aus = 


0.0862557 | 0. a + 0.15) 
0.0902494 | 0.09(1 + 0.16 — 0.07) 
0.0911287 | 0. ae + 0.16 — 0.08 — 0.09) 


0.117285 0.12(1 — 0.09) 
0.0952429 | 0.10(1 — 0.05 + 0.05) 
0.0899359 | 0.09(1 — 0.01 — 0.02 + 0.04) 


Gus =5 


Fig. 10.3. Results for Q/Ams = 
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Example 4: Q/Ays = 2. 


Table 10.4. Results for Q/Aws = 


0.1626471 | 0. a + 0.29) 0.209[46] 
0.1963533 | 0.20(1 + 0.34 — 0.35) 0.195[69] 
0.2193679 | 0. pat + 0.39 — 0.44 — 1.30) | —0.08 + 0.29 


0.3648099 | 0.36(1 — 0.21) 
0.1725913 | 0.17(1 — 0.17 + 0.18) 
0.1421756 | 0.14(1 — 0.08 — 0.09 + 0.17) 


0.4; 
0.3: 


roa f 4 


0.1: 


Fig. 10.4. Results for Q/Aws = 2. (The k = 3 MS result is slightly negative; 
only its error bar is visible.) 
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Example 5: Q/Ays = 1.7. 


Table 10.5. Results for Q/Aws = 1.7. 


0.1966624 | 0. = + 0.35) 0. a 
0.2691684 | 0.27(1 + 0.47 — 0.66) ue 
0.4153849 | 0. Baal + 0.73 — 1.58 — 8.83) | —3.60 +3 


0.6669931 | 0.67(1 — 0.28) 0. ion 
0.1970393 | 0.20(1 — 0.24 + 0.25) 0.199[49] 
0.1530735 | 0.15(1 — 0.11 — 0.11 + 0.22) | 0.153[34] 


[ Q/Auys = 1.7 | 
0.6. S | 


R 0.4; 


ł 
1 2 3 
k 


Fig. 10.5. Results for Q/Äys = 1.7. (Only the tip of the huge error bar for the 
k =3 MS result is visible.) 
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Example 6: Q/Aygs = 1.5. 


Table 10.6. Results for Q/Aws = 1.5. 


0. = + 0.42) 
0.43(1 + 0.76 — 1.70) 


2.5(1 — 0.42) 1.44 1.0 
0.22(1 — 0.31 + 0.33) 0.221[71] 
0.16(1 — 0.13 — 0.13 + 0.26) | 0.161[42] 


Q/Âys = 1.5 


Fig. 10.6. Results for Q/Aws = 1.5. (There is no k = 3 MS result in this case.) 
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Example 7: Q = 0 (fixed point). 


Table 10.7. Results for the infrared fixed-point limit, Q = 0. There 
are no MS results in this case. 


no solution 


0.2635259 | 0.26(1 — 0.76 + 1.01) 0.330267] 
0.1800794 | 0.18(1 — 0.25 — 0.16 + 0.44) | 0.185[79] 


0.8 
06 Q=0 
ROA. 
0.2. : 
uA 1 2 3 
k 


Fig. 10.7. Results for the infrared fixed-point limit, Q = 0. There are no MS 
results in this case, and no k = 1 optimized result. 


10.5. Discussion 


Let us summarize the lessons of these numerical examples, which 
compared the MS(u = Q) and optimized results. At moderately 
high energies, the differences are small — well within the error 
estimates. The main advantage of optimization here is to achieve 
better precision — and, very importantly, to have a systematic 
method that applies to other physical quantities, without the need 
for new ad hoc choices of u in each case. 

At low energies, however, there are more striking differences. 
While the optimized results show steady convergence, the MS results 
begin to exhibit the typical pathologies of a divergent asymptotic 
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series (compare, for example, with Fig. 5.3). One can expect those 
pathologies to show up in the MS results at higher energies when 
the series is taken to high enough order. In this sense, the low- 
energy examples are a “preview” of the divergent-series problems 
to be expected in MS or any fixed RS. 

Whether or not the optimized results in QCD will exhibit 
“induced convergence” is a matter of conjecture at present. The 
examples here certainly show a consistent shrinking of the optimized 
couplant from one order to the next. 

The good convergence of the optimized results remains true even 
in the Q > 0 limit, where the third-order finding of a limit R —> 0.3 
0.3 is confirmed and made more precise; R — 0.2+0.1. This result is 


important because there are many indications from phenomenology 
that the QCD couplant does “freeze” at low energies. Usually freezing 
is something put in by hand, but here it is an outcome, a prediction. 
There is nothing in the optimization approach that forces freezing 
to occur; the fact that it does for Re+e- is due to the p2, p3 values 
resulting from the Feynman-diagram calculations. The next chapter 
will discuss the topic of the infrared limit in more detail and more 
generality. 


Appendix 10.A: G Function and Re+e- Coefficients 
in MS 


The £-function coefficients, bcj, and the Re+e- coefficients, ri, in 
the MS scheme are polynomials in n f, with each fermion loop in a 
diagram being associated with an ny factor. We choose to swap nf 
for b 


o 33 — 2n f 


: 6 


(10A.1) 


both because it makes the results a little more compact and because it 
is convenient when investigating the large-b and small-b (Banks-Zaks) 
approximations. Here ¢, is the Riemann zeta-function. References to 
the original calculations are given in the Bibliography. 
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The -function coefficients, in the MS prescription, are as follows: 


__ (7) 1, 19 
e=- Cr ae 


ms (_ 37117) 1 | 243 , 325, 
a = \ e a 32 7 T92” 
as — (53981 n n (582 _ 16171 
3 1152 3 13824 288 °° 
2587 809 1093 
Z | GS, 
( 96° 14d : 3456 


The Re+e- series coefficients, in the MS(u = Q) scheme, are as 
follows: 


us 1 11 
r= Dt (+ 2a) 6 


— 12521 401 53 25 
MS = — 1 ao en Se a b 
r2 ( -Rs + 3G) + ( 4 38t3 cs) 


(E-e) (BE) E-o) 


ue 3963761 677833 275 
T + 3— zg 
20736 3456 


r3 = 
-r eet Hp Oo + G+ ee - apt 


236089 13859 pi 5a 7 = X 
1728 a 96 3 3 


38969 535 6907 165 9595 665 Je 


175 
M m = a+ tags 


35 
Ei <6 z + T6) P). 


( 
( 
mea m acc. a) 
ACs 
i 
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The p2, p3 invariants for Re+e- are therefore 


_ (12087) 1, (1483 a 
pam 256 / b \288 4° 


937 47, 2 19 i n 
+ — = GS a os) (A a + Hg- 12) b 


Ga) a *) 


(252613 = = ,„ (2217005 , 121613 if a 
P3 = \ “7608 = 10368 864 g 
07913 535, 3619, 535 9295. 665 
+ + ige — > 3 t Saas a $5 = 79 7 
576 
5683 3599 190 
+ (== = So + a 8 — 174¢3 — -y $5 + ne 
665 
(srs E5662 — 3263 156) b3 
oy a 1 175 
(Se ge + T — 6s 
325 245 > 25 
P + 7 8 — 25¢3 + Za) 6), 


For numerical values, neglecting the ` q; terms, see Table 11.1 in 
Chapter 11. 


Note that, while p2,p3,... and c have definite decompositions 
in terms of b, and hence in terms of ny, the same is not true of 
p1(Q), which involves A, whose n f dependence could be anything, 
especially since its specific definition is just a convention. 


One more coefficient of the @ function has been calculated. We 
cannot make use of it for optimization until the r4 coefficient is 
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calculated: 


ws _ (1081830511 | 17251949 , ene 191675 - 
ca 663552 13824 ` P 


1452057293 48015 4489165 T 856625 ‘ 
4 27648 S 2304 ® 


1327104 512 


(83737869 | 16171 176837 176837 , 88415 ; 
-221184 ' 512 “ 2304 °° 2304 

_ (471499 _ 809 39409 i 345 : 
110592 256°! t 9304 63 128% 

(25 _ 19 19, 
18432 2 
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Special Topics 
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Chapter 11 


Infrared Limit: Fixed and Unfixed Points 


11.1. QCD Perturbation Theory in the Infrared? 


The ultraviolet region, for an asymptotically-free theory, is the natural 
domain of perturbation theory: The effective couplant tends to zero 
as Q — oo, so when Q is large enough one can confidently expect 
perturbation theory to give a good approximation. As Q decreases 
the effective couplant increases and perturbation theory becomes less 
accurate and less reliable. Moreover, so-called “higher-twist” terms — 
terms exponentially small in the couplant, and hence suppressed by 
positive powers of A /Q — become ever more important. Thus, it may 
be foolish to consider perturbation theory in the infrared. 

On the other hand, it may not be. While the effective couplant 
grows as Q decreases, it does not necessarily grow without limit, and, 
in some circumstances, it may remain quite small even in the Q > 0 
limit. That, indeed, seems to be the case for Re+e- in QCD, as was 
seen in the numerical examples in the last chapter. 

Of course, one should recognize that the infrared behaviour of 
perturbation theory will not predict the actual infrared behaviour 
of the theory, because of the “higher-twist” terms invisible to 
perturbation theory. Experimentally, physical quantities, such as 
R.+--, show a lot of structure at low energies, related to hadronic 
resonances, whereas the perturbative prediction varies smoothly with 
Q (at least, if quark masses are neglected). Nevertheless, it has long 


This is an open access book chapter published by World Scientific Publish- 
ing. It is distributed under the terms of the Creative Commons Attribution- 
NonCommercial 4.0 (CC BY-NC) License. 


179 


180 Renormalized Perturbation Theory and Its Optimization 


been thought that the perturbative result represents a sort of average 
through the resonant peaks, so that it can be meaningfully compared 
with a smoothed or “smeared” version of the experimental data. Such 
ideas, and the successes of various phenomenological models that 
invoke a “freezing” of the QCD coupling constant, suggest that low- 
energy perturbation theory may have some limited predictive power. 
This chapter will not discuss the phenomenological issues but will 
study the infrared limit of optimized perturbation theory, motivated 
by the above considerations. 


New methods that introduce variational mass parameters for the 
quark and/or gluon fields, analogous to the CK expansion, and then 
optimize these, as well as the RS, offer a very promising approach 
to the infrared region. While it is beyond the scope of this book 
to discuss such methods (see references in the bibliography), the 
lessons learned here in “pure” perturbation theory should be useful 
background for those endeavours. 


One possibility is that there may be no infrared limit at all: 
There may be a “Landau pole” at some finite Q of order A where 
the effective couplant, and hence the perturbative result for R, goes 
to infinity. Such is the case at first order (a qualitative approximation 
only), where the effective couplant is a ~ 1/(bIn(Q/A)). In second 
order, if the coefficient c is positive (as it is for 8 or fewer quark 
flavours in QCD), then a Landau pole also occurs. However, if c is 
negative, then the second-order 3 function, —ba?(1+ ca) vanishes at 
a positive value a = —1/c, and the couplant approaches this value 
as Q > 0. At higher orders, whatever the sign of c, it might happen 
that the 8 function (in the optimized scheme) has a zero at some 
value a = a*, and if a* is quite small one can hope that perturbation 
theory in the infrared is meaningful. 

A simple zero of the 6 function is known as a “fixed point.” There 
is a body of conventional lore about fixed points that we examine 
next, before turning to an analysis of OPT in the infrared limit. 


11.2. “Fixed-Point Lore” and Its Limitations 


The key properties of a renormalizable field theory, according to 
many accounts, follow simply from a graph of its 8 function. 
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Bla) 


—U.V. 


Fig. 11.1. Conventional sketch of “the Ø function” in an asymptotically free 
theory with an infrared fixed point. The couplant flows to zero in the ultraviolet 
and to a” in the infrared. 


Figure 11.1, for instance, supposedly represents an asymptotically 
free theory with an infrared fixed point at a = a*. 

The problem with this lore is that there is no such thing as the 8 
function! The £ function is not a physical quantity; it is not a unique, 
well-defined function characterizing the theory. It is RP dependent, 
and — unlike the case of physical quantities — its RP dependence 
would remain an issue even if we could magically calculate all orders 
of perturbation theory. 


In the literature there are sometimes claims to have calculated 
“the exact 6 function” of some theory. However, since 8 is RP 
dependent, one is always free to define (a) to be any desired 
function, provided that its power-series expansion starts —ba?(1 + 
ca+O(a?)). For example, there is an RP, once used by ’t Hooft, in 
which (a) is defined to be —ba?(1 + ca). Another example of an 
“exact b function” might be —ba?/(1—ca), and there are infinitely 
many other possibilities. Of course, if one could show that some 
particular RP, associated with some particular form of the all- 
orders (@ function, also led to some specific all-orders form for one or 
more physical quantities, then that would be a major achievement. 
However, without that, claims for an “exact @ function” are 
empty. 


The RP dependence of the 8 function is, of course, acknowledged 
in conventional accounts, and the effects of RP transformations 


a’ = a(1+ua+vea? +---), (11.1) 
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with 6 transforming as 


d f 


Ta a 
= — ; 11:2 
Pia) = Z pla) (11.2) 
are discussed. It is observed that if G(a) vanishes at a = a*, then 
b'(a’) will vanish at some corresponding a’ = a’ — pictorially, 


consider a graph of 3(a) and imagine a stretching, or other distortion, 
of the horizontal axis. Thus — it would seem — the statement that 
a certain theory “has a fixed point” is basically an RP-independent 
one. (Another claim, that the slope of the 8 function at a fixed point 
is RP invariant, will be discussed later.) 

Following those remarks about RP dependence, though, most 
accounts then proceed as if “the 6 function” were a well-defined, 
unique function characteristic of the theory — implicitly assuming 
that one can choose a “good” RP once-and-for-all. However, as 
stressed in Chapters 4 and 5, with non-invariant approximations it 
is a mistake to assume that the extraneous variables must have fixed 
values. Just as the renormalization scale, u, should “run” with Q, the 
other aspects of the RS can and should be chosen differently in differ- 
ent cases. Because f(a) changes under optimization, the usual fixed- 
point lore, though undoubtedly valuable, will need re-examination. 

The fact that we are doing perturbation theory and using trun- 
cations of the 8 function is crucial. Under a scheme transformation 
(11.1) the new approximate ( function is not given by Eq. (11.2), 
but by a truncation thereof. That fact spoils the argument that, 
generically, 6’(a’) will have a zero if 6(a) does. Thus, at finite orders, 
fixed points can appear or disappear under RP transformations. The 
existence or non-existence of a fixed point, being RP dependent in 
that sense, is not necessarily a property of the theory but may depend 
on which physical quantity R is being calculated, and to what order. 


In QCD with ns = 2 the MS £ function is —ba?(1 + 1.984 + 
5.78a? + 27.5a° + ---), and has no zero in second, third, or fourth 
orders. Thus, the MS results must go to infinity at some finite Q. 
However, the optimized 8 function for Rete- in the Q > 0 
limit turns out to be —ba?(1 + 1.98a — 22a”) at third order, 
and —ba?(1+ 1.98a — 5.62a? — 1993) at fourth order, giving fixed- 
point values a* = 0.264 and 0.180, respectively. Thus, as was seen in 
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the numerical examples of the last chapter, the optimized results 
for Re+e- continue smoothly down to Q = 0. However, there is 
no guarantee that other physical quantities will necessarily show 
similar behaviour. 


It should not be assumed that this point will become less and 
less important as we go to higher orders of perturbation theory — 
because of the divergence of the fixed-scheme power series for R 
and (a). While the fixed-scheme R results will presumably show 
asymptotic-series behaviour, “settling down” temporarily to a good 
approximation to the true R, the same need not be true of b(a), 
which is not a physical quantity and has no “true value.” In OPT, 
while we can hope that successive approximations to R will show 


’ we should not necessarily expect convergence 


“induced convergence,’ 
of the associated results for the optimized (-function: It need not 
have an infinite-order limit (not even in a re-summed sense) since 
it is not a physical quantity, and inherently depends on the order 
index k. 

The optimized ð function also inherently depends on what 
physical quantity R one is calculating. Thus, it is not necessarily 
true that the physical quantities in a certain theory either all have, 
or all do not have, a finite infrared limit. One should be cautious of 
speaking of the theory having, or not having, a fixed point. 

One should also re-examine the usual lore that “if a theory has 
an infrared fixed point then all physical quantities show the same 
power-law behaviour as Q > 0.” The conventional argument goes as 
follows: Near the fixed point, G(a) is approximately linear: 


bla) xa- a“, (11.3) 


with the constant of proportionality being the slope of the 6 function 
at the fixed point: 


_ dp 


B* = da = . (11.4) 


a* 


For reasons explained below, 6* is claimed to be RS invariant. Thus, 
any RS may be used, and in the EC scheme (assuming for simplicity 
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that R has P = 1) we have, at low Q, 


dR as x 
ITG = Bac(R) ~ B(R — R*). (11.5) 


When integrated, this equation implies a power law 
R* -RX QŽ, (11.6) 


with the critical exponent y* = B* being the same for all physical 
quantities R. (The generalization to any P is straightforward; see 
Exercise 11.1.) 

The argument that 8* is RS invariant goes as follows. Consider 
two RS’s, primed and unprimed, whose couplants are related by 
a general scheme transformation, Eq. (11.1). Formally, their 8 
functions are related by Eq. (11.2) and hence the derivative of the 6 
function will transform as 


a5 dB gË da! 
da’ da a da? / da’ 


(11.7) 


Since (a) vanishes at the fixed point, it would seem that B* is 
the same in both primed and unprimed schemes. Careful authors 
qualify this result with the proviso that da’/da must not vanish and 
d?a'/da? must not be singular at a = a*. Those conditions might 
seem pedantic, but they cannot be entirely disregarded, as Chyla was 
the first to warn. (See Exercise 11.2.) The subtleties at the formal 
level are discussed in Appendix 11.A. 

Because of these subtleties we suggest that, within a given theory, 
there may be different classes of physical quantities: One class might 
have a Landau-pole, precluding any continuation to Q = 0 without 
explicitly including higher-twist, non-perturbative terms. Another 
class might have a finite infrared limit, with a common power-law 
approach to the Q — 0 limit. Other classes might also have finite 
infrared behaviour, but with a qualitatively different approach to 
the Q > 0 limit and/or a markedly different infrared couplant. 

Furthermore, formal results may or may not apply when some 
particular approximation method is used. In OPT at any order 
(except second) it turns out — for non-trivial reasons explained 
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in Appendix 11.B — that the slope of the optimized ( function 
at a fixed point does indeed give the y* exponent governing the 
Q — 0 behaviour of the optimized approximation to R(Q). However, 
it is an open question whether the OPT results for the y*’s of 
all physical quantities are approximately the same. Moreover, as 
discussed later on, in OPT a finite infrared limit can occur at an 
“unfixed point” where 6(a) does not vanish (but OR/Oa does). In 
that case R* — R is not given by a power law, but is proportional to 


1/(InQ)?. 


11.3. Infrared Behaviour in OPT at Second Order 


Before embarking upon a more general analysis of OPT in the 
infrared, we first examine the case of second order (k = 1). Here 
the 6 function is just —ba?(1 + ca). When the invariant coefficient 
c is positive, (a) has no zero: Thus, a Landau pole will occur, with 
a and the R result going to infinity at some finite Q. However, when 
cis negative (which would happen in QCD if ny > 85) the 6 function 
has a zero at a* = —1/c. One might therefore expect typical fixed- 
point behaviour, with a finite infrared limit R* of order a*?. However, 
in OPT things are more subtle because the optimized rı coefficient, 
obtained in Eq. (8.10), is 
P c 

me TERE (11.8) 
which goes to infinity at the fixed point. Thus, the OPT result for 
RO = aP (1 +74), though finite at any finite Q, tends to infinity 
as Q — 0. At low values of Q, of order Ag, where p;(Q) ~ 0 there 
is a plateau, with R®) of order a*?. It is only at ultra-low Q, when 
pı(Q) becomes large and negative, that there is a narrow “spike” 
proportional to —In@ as Q — 0. This behaviour is illustrated in 
Fig. 11.2. 

Of course, the error estimate also diverges as Q —> 0, so the height 
and width of the spike are extremely uncertain. Similar “spiking,” 
for nf Z 8, occurs in higher orders, as we shall see, though there the 
spike does not extend to infinity. 
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2nd order 
n; = 10 


Fig. 11.2. Second-order OPT results for Re+e- with nf = 10 at low energies. 
The energy Q is in units of Ar. The shaded region indicates the error estimate. 
In this case p = 1 and a* = —1/c = 26/37 ~ 0.7. There is a plateau at Q ~ Ar 
where R is roughly a*, but there is a spike, extending to infinity, as Q > 0. 


11.4. Finite Infrared Limits in OPT: Two Mechanisms 


Beyond second order, the optimization procedure involves deter- 
mining optimal values for the higher-order 6-function coefficients, 
C2,C3,..., and these evolve as the energy Q is changed. Thus, the 
optimized 8 function is not a fixed function of a, but itself evolves 
with Q. 

In some cases, this evolution makes little qualitative difference 
and the infrared limit of OPT arises in basically the usual way from 
a simple zero of the (optimized) 6 function. In QCD with ny = 2 
one finds such behaviour for Re+e- in both third and fourth orders, 
as seen in the last numerical example of the previous chapter. The 
fourth-order result, as a function of Q is shown in Fig. 11.3. The 
approach to the Q — 0 limit is characterized by a y* exponent of 
about 3, so that R is nearly constant at low energies. 


When the 7* exponent is greater than 1 it is natural to speak 
of a “freezing” of the effective couplant. When y* is less than one, 
however, it would be more descriptive to speak of a “spiking” of 
the couplant in the infrared limit. 


In some cases, however, the evolution of the optimized 8 function 
with Q plays a crucial role and a finite infrared limit in OPT occurs 
in a quite different way. This was quite unanticipated, and turned up 
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Die 


0.25 4th order 
n; =2 


Fig. 11.3. Fourth-order OPT results for Re+e- for ns = 2. The energy Q is in 
units of Ar. The shaded region indicates the error estimate. This case illustrates 
infrared “freezing” of the couplant, due to a fixed point of the optimized 6 
function. 


in numerical investigations of QCD at higher np values. The infrared 
limit here arises by a “pinch mechanism.” The evolving 6 function 
develops a minimum that, as Q — 0, just touches the axis at ap (the 
“pinch point”), while the infrared limit of the optimized couplant is 
at a larger value, a* (the “unfixed point”). This mechanism produces 
an “extreme” or “logarithmic” spiking of the couplant as Q — 0; see 
Fig. 11.4. 

The next sections examine the fixed-point mechanism and pinch 
mechanism, respectively. 


11.5. Fixed-Point Mechanism 


A finite Q > 0 limit for R(Q) can occur by essentially the familiar 
fixed-point mechanism, with the optimized B(a) function manifesting 
a simple zero at a = a* (see Fig. 11.5). The limiting behaviour can 
be analyzed as follows. For a close to a* one can linearize B(a) as 


B(a) x o(a — a), (11.9) 
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0.5 


4th order 
nf =8 


0.4 


0.1 


Fig. 11.4. As Fig. 11.3 but for nf = 8. The arrow indicates the infrared limit. 
This case illustrates “logarithmic spiking” of the couplant due to the “pinch” 
mechanism discussed in Sec. 11.6. 


0.00 0.05 0.10 0.15 


a 


Fig. 11.5. The evolving optimized B(a) = 8(a)/(—ba’°) function at fourth order 
for ns = 2. The upper, solid curve is the Q = 0 limiting form with a fixed point 
at a* = 0.180844. The two lower curves correspond to larger Q values, and are 
shown dashed when a > a. 
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where o is some positive constant (directly related to the slope of 
the 8 function at its fixed point; 6* = ba*?o). The integrals I;(a) of 
Eq. (7.10) will then diverge in the infrared limit, a > a*: 


a j—2 +j=2 1 
o o*(a* — zx) (a* — a) 


Substituting in B;(a), Eq. (7.9), one finds that the oa) factor is 
cancelled by the (a* — a) factor in B(a), yielding 
Ge) 
B; —, L111 
j0) + ~ (11.11) 
This result corresponds to 
ða |* 


— | = 11.12 
Oc; ? ( ) 


which indeed follows directly by asking how the root a* of the 
equation J`; cka*’ 
Exercise 11.3). The mon parameter o is given by 


= 0 changes as one specific c} is varied (see 


k-1 
o = —B'(a)| = =- jga miey (k-je, (11.43) 
j=0 


where the last step uses the fixed-point condition y a*Î = 0 to 


j=0 © ja 
eliminate c;.. 
With the limiting B;’s from Eq. (11.11) one can construct the 


limiting H;’s: 


1 : 
“a (i-—j—1). 11.14 
>E gali- -1 (11.14) 


Hence, in the fixed-point limit, the formula for the sm = ( Pum) Tmi 


coefficients, Eq. (9.13), becomes (for k > 1) 


m-—1 


sa T Le I(k—-m—j-1)-— So ca (k— m-— j) 


j=0 
(11.15) 
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Writing k — m — j as k — 2m + (m — j) and noting that the (m — j) 
terms will cancel between the two summations, one may simplify the 
result to 


1 


êm = kD [(k — 2m — 1)tm — (k — 2m)Êm-1], (11.16) 
or, equivalently, 
il R 
Sm = k —2m)ém —tm|, 11.1 
5 E-I [( m)ĉ trn] (11.17) 
where 
êm = sna”, m = a”, tm = SG (11.18) 
i=0 


One may substitute Eq. (11.17) for the optimal-scheme r*,’s into 
the expressions for the p; invariants. From the resulting p2 one can 
solve for the optimal-scheme c3 in terms of a*,c, p2. Then, making 
use of that result, one may solve for c3 in terms of a*,c, p2, p3, and 
so on up to cj. Substituting in the fixed-point condition B(a*) = 0 
then produces an equation for a* that involves only the invariants 
C, P2,+--+, Pk- One can then find a* numerically as the smallest positive 
root of that equation. Finally, the expressions for the c*’s in terms 
of a* and the invariants can be substituted back into Eq. (11.17) 
to determine the r*,’s. Hence, one can find R* in terms of invariant 
quantities. 

We now specialize to low orders and to the P = 1 case. 

At third order (k = 2) one finds from Eq. (11.17) 


§, = -(14+ 0), $2 = —2¢2, (11.19) 
and hence 
P (1+ ca*) 2 
S A =I, 11.20 
ry a" 9 3°2 ( ) 
Substituting in the definition of p2 and solving for c3 yields 
eA 3 2) *2 
Cy = —5 (1 + (4p2 — c*)a**). (11.21) 


4a*2 
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Then substituting in the fixed-point condition B(a*) = 1 + ca* + 


cha*? = 0 gives 


7 1 
a ca" +3 (v — ze) a*? =0. (11.22) 


The relevant a* is the smallest positive root of this equation. The 
result for R* at third order can then be expressed as 


7 1 

=a G + zea). (11.23) 

At fourth order (k = 3) one obtains 

1 1 3 
§= — $= aie +€+2@2), 83 = — 563 (11.24) 
and hence 
P 1 P (1+ ca* + 2c3a*?) : iy 

Ti = TIa’ T2 = = p 2 n T3 = Tg (11:25) 


By substituting in the definitions of p2, 3, one can then find c5, c3 
in terms of a* and those invariants: 


_ 11 —4ca* + A8p2a*? 


ch = OE (11.26) 
, d+3ca* + 16p3a*? 


The fixed-point condition can then be expressed entirely in terms of 
invariants as 


8313 3 
39 + rae + 5 p20" a 4p3a** = 0. (11.28) 


The final result for the limiting value of R at fourth order can then 
be simplified to 
249 13 l 
*=a* | —+—ca* + —p2a*" ]. 11.2 
R* =a (tae UETA ) (11.29) 
Results for the Re+- case will be discussed in Sec. 11.7. At third 


order fixed-point behaviour is found at all n p’s, though for low ny the 
error estimates are large. At fourth order a fixed point is found for 
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ny up to 6, and also for nf = 16. However, there is no positive root 
of Eq. (11.28) when ny is 7,...,14, while for ns = 15 a positive root 
exists but it gives a negative a, which is unacceptable (see Exercise 
11.10(iii)). Nevertheless, at those np’s the optimization procedure 
yields results that remain bounded as Q —> 0; it does so, however, by 
a quite different mechanism. 


11.6. Pinch Mechanism 


The essence of the pinch mechanism is illustrated in Fig. 11.6, which 
shows the evolution of the optimized B(a) function in the nf = 8 
case. As Q is lowered the optimized co, c3 coefficients change so that 
B(a) develops a minimum — which, in the limit Q — 0, just touches 
the horizontal axis at a “pinch point,” ap. Although this point is then 
a double root of B(a) = 0, it does not represent a fixed point. The 
infrared-limit of the optimized couplant is not ap but a larger value, 
a*, dubbed the “unfixed point” to stress that it is not a zero of the 
6 function. 


The infrared limits of a or the rm, c; coefficients will be indicated 
by * rather than * when they arise from an unfixed point, not a 
fixed point. 


B(a) 


Fig. 11.6. The evolving optimized B(a) = 6(a)/(—ba”) function at fourth order 
for ng = 8. The curves, from top to bottom, are for descending Q values. They 
are shown dashed for a > a. The lowest curve is the infrared-limiting form, with 
the pinch point at ap = 0.3094 and the unfixed point at a* = 0.432267. 
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One can understand this infrared behaviour analytically as 
follows. B(a) can be approximated around its minimum (at, or nearly 
at, the pinch point a,) by 


B(a) ~ n ((a — ap)? +. 6°), (11.30) 


where ô + 0 as Q —> 0 and ņ is some positive constant. Thus the 
integral for the K (a) function in Eq. (6.22) becomes dominated by a 
“resonant peak”: 


1 
= f i E E (11.31) 
p” 


Therefore, in the Q — 0 limit (where p(Q) = K(a) — rı tends 
to —oo), the ô parameter vanishes x 1/(— ln Q). (Appendix 11.B 
discusses the approach to the Q — 0 limit in more detail.) 
The integrals I;(a) of Eq. (7.10) are also dominated by a huge 
peak in their integrands around ap: 
qi? a T 


~N 


naw | ae a Č 25% 


(11.32) 


One can thus see that the Bj and H; functions, diverge like 1/ 6° 
as ô + 0. Note that the B(a)/a)~+ factor in Eq. (7.9) will involve 
the limiting value of a, which is a* and not ap. Although the B;’s 
and H,’s diverge, the optimized rž, coefficients are finite, because the 
1/8? factors cancel out, as does n, in Eq. (9.13). 


Since the B;’s diverge, the ck optimization equation S = AOR 
Eq. (8.23), means that OR/0a = pa’—'S vanishes at the unfixed 
point a = a*. Thus the second term in the 7 optimization equation: 


OR _ OR| | Ba)OR _ 


@ oie ee 


vanishes in both the fixed-point case, where 3(a) = 0, and in the 
unfixed-point case, where OR /ða = 0. This corresponds to the scale 
dimension of R vanishing when R tends to a constant. (See also 
Exercise 11.4.) 
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Instead of Eq. (11.17) of the fixed-point case, the formula for the 
s*, coefficients can be written as (see Exercise 11.5) 


1 = 
Snap = E-I) > -m—J— 1a, 
j=0 
“i m-1 
= (=) (k —m— j&i], (11.33) 
a 
j=0 
where sm = (2¢™) rm. The pinch point ap is where the limiting 
P P 


form of the B(a) function touches the a-axis (see Fig. 11.6) and 
hence satisfies the two equations 


k 


= Sica, =Q; 


a=ap j=1 


dB* 


k 
* = žal = a 
B*(a) = 2e 0 and 7 


(11.34) 


Using these equations one may write an alternative form of sm 
formula that is more convenient when m is large: 


— (2) Sa - m- j)ġa |. (11.35) 


There are various ways of proceeding to combine the rm’s, the pinch- 
point conditions, and the p; definitions to determine ap, a* in terms of 
invariants. One general strategy is to use Eq. (11.33) for all the r,,’s 
except for rg, for which Eq. (11.35) is simpler. Then, substituting in 
the p2,..., Pr definitions one may solve, successively, for co,..., Cx 
in terms of ap, a* and the invariants. Substituting in the pinch-point 
conditions in the form 

k-1 k 

Sok — j)Chap! =0 and {Do =0 (11.36) 

j=0 j=l 
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(so that the highest power of ap involved is a") gives a pair of 
equations that determine ap and a* in terms of the invariants alone. 

We do not go further with the general case, but now specialize 
to low orders and the P = 1 case. 

At third order the pinch mechanism can occur, but only under 
certain restrictive conditions (which are never satisfied in the et e~ 
QCD case, but for other physical quantities, or other theories, the 
possibility could arise.) At third order the B(a) = 1+ ca + cga? 
function can obviously be rewritten in the form 


B(a) = n ((a — ap)? + 6°), (11.37) 
with 
n =c, —2apn = C, nla? +8) =1. (11.38) 


If 7 = c is positive and c is negative, B(a) has a minimum at a 
positive ap = —c/(2c2) that can become a pinch point if the evolution 
of the optimized c2 coefficient results in 6 tending to zero as Q —> 0. 
From Eqs. (11.33) and (11.35), respectively, one finds 


* 
eo 1+ca oe 2ap 4 
= Sm O. 
3a 


oar (11.39) 


From Eq. (11.38) with 6 > 0 (or, equivalently, from Eq.(11.34)) one 
obtains 
2 e 


ap = E C5 = T (11.40) 


(Using these one may rewrite rj as —3z&.) Substituting in the 


definition of pə yields a quadratic equation for a*: 


4 ae (oa x2 
1 — =ca* +4 ( p2 -— | af =0. (11.41) 
3 2 
The infrared limit of R can be written, using Eq. (11.39), as 


1 
R= gt 3 —ca*). (11.42) 
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As noted above, the pinch mechanism requires c to be negative, 
so Eq. (11.41) will only have a positive root if p2 — Ca is negative. 
Finally, the pinch mechanism requires a* > ap which requires p2/ es 
13/48 (and for smaller p9’s the fixed-point mechanism takes over). 
In summary, the pinch mechanism can operate at third order if and 
only if 


i3 oe í 
c<0 and E < a ae (11.43) 
At fourth order (k = 3) one finds 
* (a* — 2ap) 
Fi == 
4a* dp 
1 
= Aa (1 + ca* + 2cha*ap), (11.44) 
3 Ap 
"3 BGR 


Following the strategy outlined above (but see Exercise 11.6 for an 
alternative method) we use the p2 and p3 expressions to solve for c5 
and c$: 


* * * x2 
$= 3a*? — 4a*ap(1 — 5ca*) + 12a? (1 — 2ca* + 4p2a ) (11.45) 


32a** a2 
—a*® + a**a,(2 — 5ca*) — 4a*a3 (1 — 3ca*) + 4a3 (2 — ca* + 4p3a**) 
4a*? (4a* — 3ap)a2 l 


. (11.46) 


Substituting into 3 + 2cap + cza? = 0 and c+ 2c2ap + 3c3a2 = 0 
(which are equivalent to B'(ap) = B(ap) = 0) yields 


99a** — 4(1 — 21ca*)a*ay + 12(1 — 2ca* + 4pga**)a2 


2=0, (11.47) 


— a*?(1 — 84ca*) + 12(1 — 5ca* + 16p2a**)a*ay 
+ 12(5 + 2ca* — 12p.a** + 16p3a*°)a2 =0. (11.48) 
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Eliminating aa between these equations gives ap in terms of a* as 


a* (62 + 14ca* + (21c? — 148p2)a*? — 6(7p2c — 33p3)a*?) 
2(2 — 31ca* + 3(4p2 — c?)a** + 4(6p2c + p3)a*? + 12(4p2 — Tp3c)a**) 
(11.49) 


ap = 


Substituting back into either equation yields a 6th-order polynomial 
equation that determines a* in terms of the invariants: 


0 = 11680 + 2224ca* + 3(5997c? — 17264p2)a*? 
+ 2(8235c? — 33624p2c + 36976p3)a*? 
+ 18(147c* — 2184p 9c? + 4640p3 + 502p3c)a** 
+ 324(—49poc3 + 152p3c + 161p3c” — 528p3p2)a*” 
+ 108(—147p3c? + 528p3 + 343p3c? — 1386p3p2c + 1089p3)a*®. 
(11.50) 


The final result for the infrared limit of R at fourth order can be 
expressed as 


a*(2a*ap + 12ap? + 3a**(2 + cap)) 


R= 
24ap? 


. (11.51) 
Note that a* > ap is needed for this solution to be relevant. One 
can check that the special case a* = ap is indeed the boundary 
between the pinch mechanism and the fixed-point mechanism, and 
corresponds to where y* = 0. From such an analysis one can 
determine the precise (non-integer) np values where the switchover 
from one mechanism to the other takes place. 


11.7. Numerical Results for Re+e- 


In this section, we present the numerical results from OPT in the 
infrared limit for Re+e- in QCD with ny flavours of massless quarks. 
We consider integer np from 0 up to 16. (At higher n¢’s the theory 
would not be asymptotically free.) The inputs to the numerical 
calculations are collected in Table 11.1, which lists the RS-invariant 
quantities c, p2, p3. These values are obtained from the formulas 
quoted in Appendix 10.A. The Feynman-diagram calculations used 
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Table 11.1. Values of the invariants for Re+e- 
(with > q: = 0) found from the formulas quoted 
in Appendix 10.A, obtained from the Feynman- 
diagram calculations cited in the Bibliography. 


nf c p2 p3 
0 2.31818 —7.066723 —184.37823 
1 2.16129 —8.397865 —147.27522 
2 1.98276 —9.842342 —113.85683 
3 1.77778 —11.417129 —83.83139 
4 1.54 —13.144635 —56.87785 
5 1.26087 —15.055062 —32.63303 
6 0.928571  —17.190118 —10.67155 
7 0.526316 —19.609073 9.52688 
8 0.029412 —22.399086 28.63336 
9 —0.6 —25.693806 47.57023 
10 —1.42308 —29.709122 67.70445 
11 —2.54545 —34.817937 91.25169 
12 —4.16667 —41.724622 122.21944 
13 —6.71429 —51.938541 168.96670 
14 —-11.3 —69.384046 252.90695 
15 —22.0 —108.450422 452.02327 
16 —75.5 —298.641242 1466.56390 


are cited in the Bibliography. In order to simplify the comparison of 
different nf cases — and to avoid having to assign specific electric 
charges to fictitious extra quarks — we have chosen to drop the terms 
proportional to X` qi. Those terms were included in the examples in 
Chapter 10, but they make very little difference to the infrared limit 
in the phenomenologically relevant case of np = 2. 

At third order only the c and p2 information is used. The 
OPT results are given in Table 11.2. The quoted error estimate on 
R corresponds to the last term, rza, of the truncated perturbation 
series, evaluated in the optimized RS. Also listed are values of the 
fixed-point couplant and the exponent y*. The R* results are plotted 
in Fig. 11.7. At low np’s the uncertainty is very large, about 100%, 
but at larger n y’s the results become increasingly precise. 

At fourth order the results are given in Table 11.3. The 
quoted error estimate on R corresponds to the last term, r3a‘, 
of the optimized perturbation series. Also listed are values of the 
fixed-point, or the unfixed-point and pinch-point. The fixed-point 
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Table 11.2. Third-order OPT results for the 
infrared-limit of Re+e- (with X qi = 0) for 
different ny values. The last column gives 
values for the critical exponents y* which 
characterize the power-law approach of R to 
its fixed-point limit; R* — R « or. 


np a* R* y* 
0 0.31328 0.40 + 0.36 4.70 
1 0.28746 0.37 + 0.31 3.89 
2 0.26466 0.33 + 0.27 3.23 
3 0.24422 0.30 + 0.23 2.68 
4 0.22559 0.28 + 0.20 2.21 
5 0.20837 0.25 0.18 1.81 
6 0.19218 0.23 + 0.15 1.47 
7 0.17669 0.21 + 0.13 1.17 
8 0.16160 0.19 + 0.11 0.92 
9 0.14658 0.17 + 0.09 0.70 
10 0.13132 0.15 + 0.07 0.52 
11 0.11542 0.13 + 0.05 0.36 
12 0.09846 0.11 + 0.04 0.23 
13 0.07998 0.086 + 0.025 0.14 
14 0.05954 0.063 + 0.013 0.066 
15 0.03691 0.038 + 0.005 0.022 
16 0.01249 0.0126 + 0.0005 0.002 


pp 
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Fig. 11.7. Infrared limiting values of R.+.-— (with ` q: = 0) in third-order OPT 
as a function of ny. The shaded region indicates the error estimate. 
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Table 11.3. Infrared-limit results for Re+e- (with ` qi = 0) in OPT 
at fourth order for different ns values. For ng = 0,...,6 and nf = 16 
the limit is governed by a fixed point at a*: For ny = 7,...,15 it arises 
from the pinch mechanism, with an “unfixed point” at a* and a “pinch 
point” at ap. (The a* equation has solutions outside this range, giving 
the values in parentheses, but these violate the a* > ap requirement. 
Also, the fixed-point equation has a solution for nf = 15, but one that 
violates the y* > 0 requirement.) The last column gives values for the 
critical exponents y* which characterize the power-law approach of R 
to its fixed-point limit; R* —R « or, In the unfixed-point case, one 
finds instead R* — R x 1/(InQ)? so that, in a sense, y* vanishes. 


nf a* a* Gp R" y* 
0 0.158279 (0.1334) (2.50) 0.164 + 0.083 3.28 
Í 0.168688 (0.1465) (1.20) 0.174 0.083 3.20 
2 0.180844 (0.1633) (0.832) 0.185 + 0.080 3.09 
3 0.195462 (0.1857) (0.651) 0.199 + 0.073 2.94 
4 0.213910 (0.2162) (0.540) 0.214 + 0.059 2.73 
5 0.239369 (0.2588) (0.462) 0.235 + 0.028 2.40 
6 0.282493 (0.3164) (0.402) 0.266 + 0.051 1.76 
7 — 0.383293 0.3525 0.35 + 0.37 0 
8 — 0.432267 0.3094 0.48 + 0.64 0 
9 — 0.429519 0.2702 0.52 + 0.75 0 

10 — 0.376034 0.2341 0.44 + 0.61 0 

11 — 0.301883 0.2001 0.32 + 0.38 0 

12 — 0.229746 0.1673 0.21 + 0.21 0 

13 — 0.166832 0.1346 0:14 £0.11 0 

14 — 0.112784 0.1007 0.08 + 0.05 0 

15 (0.0674) 0.065248 0.0642 0.043 + 0.015 0 

16 0.020058 (0.0215) (0.0228) 0.013 + 0.001 0.001 


mechanism operates for nf < 6.727, then the pinch mechanism takes 
over until nf = 15.191, when the fixed-point mechanism returns and 
operates until ny = 165 when a* > 0. 

Table 11.3 also gives the values of the exponent 7* in the power 
law R* —R x QV that applies at a fixed point. At fourth order it is 
given by 


* = ba*(3 + 2ca* + c9*a*?). (11.52) 


Note that y* is around 2 or 3 for 0 < ny < 6, so the resulting 
low-Q behaviour (Fig. 11.3) is appropriately described as “freezing” 
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Table 11.4. Terms in the optimized -function and œR series in the infrared 
limit (a = a* or a*, as appropriate). 


nyf ca C2 a? c3a° T1a T2 a? rza? 
0 0.36692 0.032328 —1.39925 —0.25 —0.238596 0.524718 
1 0.364583 —0.060272 —1.30431 —0.25 —0.20734 0.489117 
2 0.35857 —0.183906 —1.17466 —0.25 —0.165126 0.440499 
3 0.347489 —0.353982 —0.993507 —0.25 —0.106587 0.372565 
4 0.329421 —0.599622 —0.729799 —0.25 —0.021696 0.273674 
5 0.301813 —0.987899 —0.313913 —0.25 0.112331 0.117717 
6 0.262315 —1.74675 0.484438  —0.25 0.371865 —0.181664 
7 0.201733 —3.98554 2.80955  —0.228168 1.11073 —0.968966 
8 0.012714 —5.89311 5.48146  —0.150668 1.72852 —1.47106 
9 —0.257711 —6.75979 7.37992 —0.102638 2.05663 —1.74115 
10 —0.535126 —6.02116 6.90794 —0.098434 1.8826 —1.61273 
11 —0.768429 —4.50798 5.11621 —0.122888 1.44444 —1.27189 
12 —0.957273 —3.02694 3.3723 —0.156741 0.999204 —0.921032 


13 —1.12016 —1.83192 2.08731 —0.190139 0.635461 —0.631526 
14 —1.27446 —0.907359 1.20999  —0.220064 0.353673 —0.405223 
15 —1.43545 —0.183663 0.619584 —0.245711 0.135041 —0.228425 
16 —1.51438 0.352822 0.161556 —0.25 —0.031878 —0.060584 


of the couplant. However, when y7* is very small one sees instead 
a “spiking” at Q — 0. Thus, the nf = 16 case, although it has 
a fixed point, is qualitatively similar to the “logarithmic spiking,” 
R* —R x 1/(InQ)?, of the unfixed-point case. 

Table 11.4 gives the optimized coefficients, weighted by the 
appropriate power of a, in both the 8 function and R series. This 
information shows the behaviour of the truncated series for both 
R and B(a) — which is, at best, only marginally satisfactory: 
Clearly, by going to the Q — 0 limit we are pushing low-order 
perturbation theory well beyond its comfort zone. Nevertheless, all 
things considered, we believe that the results are credible within the 
large uncertainties quoted in Table 11.3 and illustrated in Figs. 11.3 
and 11.4. In particular, we believe that the dramatic Q — 0 spike 
produced by the pinch mechanism is real; the very large error 
estimate just cautions that the height of the spike is very uncertain; it 
might be somewhat smaller, or it might well be considerably bigger. 
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Fig. 11.8. Infrared limiting values of R,+,— (with X` q = 0) in fourth order OPT 
as a function of ny. The OPT points are shown as blue circles when they arise 
from a fixed point and as red squares when they arise from an “unfixed point.” 
The shaded region indicates the error estimate on R*. (The huge uncertainties for 
7 <ny S12 are because the height of the “spike” is so uncertain; error estimates 
at low, but finite Q values are much more modest.) The smaller orange points 
are the corresponding results in the EC scheme. The fourth-order Banks—Zaks 
expansion about ny = 16.5 is shown by the pink dashed curve. The dotted blue 
curve represents R* = 0.84/b, the asymptotic behaviour of fourth-order OPT for 
large-b. 


Figure 11.8 plots the infrared limiting R values against np. The 
large “bump” around ny ~ 9 is where the pinch mechanism produces 
really dramatic spiking of R as Q — 0, as seen in Fig. 11.4 for nf = 8. 
If, instead of R*, we had plotted R(Q) for some low, but finite Q — 
say around LAR — the bump would not have appeared and the points 
would have been close to the smaller, orange points. Moreover, the 
error estimates would have been much less. 

Those smaller, orange points are the infrared-limiting results in 
the FAC/EC scheme, defined such that all the rm coefficients vanish, 
giving R = dgc(1+0+0+---). The EC 6 function’s coefficients 
then coincide with the pn invariants (and so can be read off from 
Table 11.1). Since those coefficients do not evolve with Q, the infrared 
limit in EC is simply obtained by finding the fixed point of the EC 
6 function. Many authors have observed that, at low orders, the EC 
results are very similar to those of OPT. That observation holds 
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true at low ny and close to ng = 163. It also holds in the range 
7S nf S 18 at energies Q 2 LAR. However, there is a notable 
difference at the very lowest energies: The EC scheme does not see 
the extreme spiking at Q = 0. While it is still true, because the error 
estimates (see Table 11.3) are so large in this region, that OPT and 
EC infrared results agree within the error estimate, it is fair to say 
that the presence or absence of the spike is a qualitative difference 
in the predictions of the two schemes. 


11.8. Discussion 


We now briefly discuss the implications of these results for QCD. 
The abrupt change around ny = 7 (and clearly associated with the 
change of sign of c occurring at nf = 85) seems indicative of a phase 
transition. For n¢ < 6 the phase is presumably the one we are familiar 
with in the real world; colour is confined and chiral symmetry is 
broken, with the associated Goldstone bosons (pions) being massless 
when quark masses are neglected. Vector mesons (p’s, etc.) have 
masses of order A and their resonant contribution dominates et e~ > 
hadrons at low energies. Although the actual Rete- x 1+ R is 
very different from the smooth perturbative prediction, the two agree 
well after Poggio-Quinn—Weinberg smearing is applied to both (see 
Ref. [15] and other references in the Bibliography). 

An interesting possibility is that the low-ny theories can be 
understood through the “large-b approximation.” That approach 
can be viewed as an extrapolation from nf = —oo (remember that 
b = (33 — 2ny)/6 must be positive for asymptotic freedom). We 
shall not discuss the large-b approximation here, since it involves 
highly technical all-orders Feynman-diagram issues. The technical 
subtleties in the infrared region are particularly difficult, but there 
are good reasons to expect that R* œ 1/b, where the constant of 
proportionality is not universal but should, in principle, be calculable 
for any specific physical quantity. The large-b limit of fourth-order 
OPT (see Exercise 11.7) is R* ~ 0.84/b, which is shown by the dotted 
blue line in Fig. 11.8. Intriguingly, it well describes the OPT results 
up to nf = 6. 


204 Renormalized Perturbation Theory and Its Optimization 


For n¢ > 7 the infrared limit is described by an unfixed point. The 
effective low-energy theory here seems to be a renormalizable theory 
with the energy scale Q appearing only in logarithms. The extreme 
spiking of R as Q — 0 (Fig. 11.4), if viewed as a resonant peak in the 
vector channel, hints that massless vector bosons are now present. 
These might be the gluons of an unconfined phase, or they might be 
massless, colourless vector mesons of a confined phase, perhaps with 
unbroken chiral symmetry. This phase presumably persists until n = 
164 (beyond which asymptotic freedom is lost). Although the OPT 
results switch back to a fixed-point limit for nf = 16, there is hardly 
any qualitative difference between the extreme (logarithmic) spiking 
of the unfixed-point case and the very strong (fractional power-law) 
spiking of a fixed-point with a very small y*. Note that the theory 
with 16 flavours (or 16.4999, for that matter) is not exactly scale and 
conformal invariant. Moreover, the phrase “approximately conformal 
invariant” needs to be used with care. While there is a huge range 
of Q over which R is nearly constant (at a value about 0.78 of its 
infrared limit, as will be discussed in Chapter 13), it does fall to zero 
(very slowly) as Q —> œo and it does rise (very abruptly) as Q > 0. 

The approach to nf = 164 is very interesting. Indeed, one 
can make an expansion about that limit, as first noted by Banks 
and Zaks. This Banks—Zaks (BZ) expansion is essentially a small- 
b expansion. We return to discuss it, and to consider its regime as 
a playground to explore some aspects of OPT at arbitrarily high 
orders, in Chapter 13. 

Before concluding this chapter we remark that it is an open 
question whether the fixed-point and pinch mechanisms are the only 
two ways that a finite infrared limit can occur in OPT. Possibly, there 
might be still more exotic possibilities, especially in higher orders, for 
certain ranges of values of the c and p; invariants. 


Appendix 11.A: Effective Exponents and the 
Slope of G(a) 


According to the usual lore, the slope of the 8 function at a fixed 
point, 6*, is scheme invariant. Moreover, it supposedly can be 
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identified with the critical exponent y* that governs the power-law 
approach of any physical quantity R to its fixed-point value: 


(R* —R) x QT (11A.1) 
as Q — 0. These statements are sort-of true, but not quite. 


Indeed, a stark contradiction arises if the slope of the 8 
function at the fixed point, is taken to be a scheme-invariant 
quantity. See Exercise 11.2. 


The formal issue can be resolved by defining an “effective 
exponent” y(Q) associated with a specific physical quantity R. It 
is related to the slope of the 8 function but has an extra term that 
is crucial for its RS invariance. The following discussion will be at 
the formal level. In the next appendix, we comment on issues arising 
from the need to approximate and discuss the approach to Q —> 0 in 
OPT. 

As usual, we consider a dimensionless physical quantity R with 
the perturbation expansion 


R=aF(lt+rjat---). (11A.2) 


Since R is a physical quantity and Q is a physical parameter, the 
successive logarithmic derivatives of R: 
dQ 


for n = 1,2,3,..., with Ry) = R, must be RS-invariant quantities, 


Rin+1| =Q 


(11A.3) 


for any Q. In particular, the combination 


Riz] ËR JAR 


(Q =z =1+9 


is RS invariant. It is the exponent of the local-power-law form of 
R(Q) in the following sense: Take the first three terms of the Taylor 
expansion of R about Q = Qo and fit them to the power-law form 


Rx K+CQ) (11.5) 
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to find 
Ro = Rio-g, = K + CQ}, 
R) = a hs CQ, (11A.6) 
(= a on yy - 1)CQy”. 


These algebraic equations can be inverted to find the three param- 
eters K,C, and y. (Note that K is not Ro in general, though it is 
when Qo — 0, assuming y > 0.) In particular, 


1 


Ro 
yal QoRr , (114.7) 


which is the y(Qo) of Eq. (114.4). 


Note that Ri} /R is Dir), the scale dimension of R, while (Q) 
is (Q/Rj2])dRi2}/dQ, and hence is the scale dimension of Rj). 


At high energies, where R œ (1/InQ)?, one has a negative y, 
but as Q is lowered y becomes positive. As Q — 0 it becomes the 
critical exponent 7* that (for any P) governs the approach of R to 
its fixed-point value R*: 


(R*-R)xQY asQ—0. (11A.8) 


In the perturbative expansion of R, in some specific RS with 
renormalization scale u, the only Q dependence resides in the series 
coefficients r;. For dimensional reasons, these can only depend on Q 
through the ratio Q/y. Thus, we have 


dri aP ti = are OR 
=e E DD ae ce (11A.9) 


i a 


where the u partial derivative is taken holding a constant. The u RG 
equation says that the total u derivative of R vanishes: 
D = OR OR 


0= PTa = YOu ` + Bla) a (114.10) 
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so that one has 
dR OR 
dQ = Bla) 


Since Ry) is itself a physical quantity, we can apply the same 
argument to it to get 


Rp =Q (114.11) 


OR dB OR PR 
Rig = bla) = B(a) (<> -+ aa) (11A.12) 


Dividing Eq. (11A.12) by Eq. (114.11) yields the key result: 
dB PR /OR 
da * 9) Sar! Ba" 


We digress briefly to recall a similar issue for the anomalous 
dimension of a Green’s function or proper vertex, I, which is 


defined as 
uT 1/ a or 
0) = BGT! aa tO Ge : 


(Q) = (11A.13) 


It is not a physical quantity, but a physical quantity, the scale 
dimension of I’, can be defined (echoing the discussion at the end 
of Sec. 2.7) as 


d 
Day = an At p,a(u)) 


K 
T 


K=1 

(This could be written as ga in terms of an overall physical 
scale Q.) The important point here is that the wavefunction- 
renormalization constant Zr) that multiplicatively renormalizes 
T is independent of the momentum arguments q; and cancels 
out in the equation above. By the dimensional argument leading 
to Eq. (11A.9), modified to allow for T having an overall mass 
dimension of D, we see that 


Dig = D -= yr) + 


which is analogous to Eq. (11A.13). 
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Returning to 7(Q), it is instructive to check directly that 
Eq. (114.13) is invariant under scheme transformations. The deriva- 
tives of R transform as 


OR OR jaa! 
ða! ða/ da’ 
PR D (IR je) ja 
ða? ða \ Oa/ da da 
OR OR Ëa sda’ ih 
= | — — — — u; 11A.14 
(Zs Oa T/E) (ay? ( ) 


Hence, the second term in Eq. (11A.13) transforms as 


R JOR ok da! da! 
/ / _ ae 
B@) 558 / gy = Fla ayo Basa / qq (114-15) 
Adding this to Eq. (11.7) we see that 
dp’, ,PR JOR dp PR JƏR 
dg ge aa aa oe a e 


confirming that y(Q) is genuinely scheme independent. 

Further insight into y(Q) is the following observation. Specialize 
to the case P = 1 (or define Rnew = RI : ) and consider the “effective 
charge” (EC) renormalization scheme defined so that R = a(1 +0 + 
0+---). In this scheme 8? R/ða? = 0, so Eq. (11A.13) reduces to 


dprc(R) 
dR ` 


Thus 7(Q), at any Q, is the slope of the EC 8 function at the 
corresponding R. In particular, in the infrared limit, the critical 
exponent 7* is the derivative of the EC £ function at the fixed point. 
Moreover, from Eq. (11A.13), we can say that y* is the deen eave 
of the 8 function at the fixed point in any scheme for which 2 = is 


(Q) = (114.17) 


non-zero and FR is non-singular at a = a*. That includes a large 
class of possible RS’s, but by no means is this “almost all” schemes. 
In general we must go back to Eq. (114.13) and carefully consider its 
infrared limit. For an instance where this subtlety arises see Sec. 13.5. 
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An important open question concerns the “universality,” or 
otherwise, of y*. Is it the same for all perturbative physical quantities 
R? The question hinges on whether the EC couplants a and a’ for 
two different physical quantities R and R’ always have da’ /da|* non- 
zero and d?a’ /da?|* non-singular. Possibly yes, but it may well be 
that physical quantities segregate into distinct classes, each with a 
characteristic value of y*. It is this point — that y* is not necessarily 
the same for all perturbative physical quantities of a given theory — 
that is perhaps the most important lesson to be drawn here. 

The discussion so far has been entirely at the formal level. When 
we approximate R and ((a) a whole set of other issues arises. These 
are discussed in the following appendix. 


Appendix 11.B: Approach to the Q — 0 Limit 


When approximating 7(Q), or its infrared limit y*, the most 
meaningful result is just the original definition, Eq. (114.4), with 
R replaced by its approximation. For some approximation method- 
ologies that is the same as using the formal result, Eq. (11A.13), with 
the R and (a) replaced by their approximations — but that is not 
always the case. 

Let us first consider Rp = QdR/dQ in (k + 1)th order. 
(Henceforth R and 6(a) should be understood as R“+) and 
B+) (a), respectively.) In fixed-RS perturbation theory, where the Q 
dependence resides in the Q/ dependences of the r; coefficients, the 
argument in Eqs. (114.9) holds, except that the series are truncated. 
The next step, using the u RG equation, will therefore not yield 
B(a)OR/Oa, but only the truncated series thereof — which does not 
factorize, and does not vanish when the approximated 6 function 
vanishes. In RG-improved perturbation theory, where ju is set equal 
to Q, the Q dependence is transferred to a(Q), so the factorized form 
B(a)OR/Oa is regained, with each factor being a truncated series. 

In general, if the RS choice evolves with Q then 


dR dR dR d(RS) 
72 TIES ASSA (11B.1) 


i dG lixars IRS) dQ 
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where 


(11B.2) 


In OPT OR/Or and OR/Oc; vanish by the optimization equations. 
Thus, all the complications of the optimized r; and c; coefficients 
evolving with Q can be ignored, and the fixed-RS result, Eq. (11A.9) 
holds. Then, by using the 7 optimization equation in the role of 
Eq. (11A.10), one obtains the result 


I = pla (11B.3) 


where (a) and R have their optimized forms. 

However, when it comes to Ryg the same argument cannot be 
repeated, because the optimal scheme for R is not the optimal scheme 
for Ry}. Thus, y(Q) in OPT is not given by the formal expression, 
in general. At a fixed point, however, the naive expression for y* is 
valid, as we now show. 

First, we note a subtlety that one must beware of. (In the 
following, all scheme-dependent quantities should be understood to 
take their optimized values.) A convenient small quantity is €e = B(a), 
which goes to zero in the fixed-point limit. The difference R* -R will 
turn out to be of order e. However, the optimized a, rm, and c; all 
have eln € corrections as they approach their fixed-point limits. The 
B function (which we write with a dummy argument x, reserving 
“a” for the optimized couplant) evolves as shown in Fig. 11.9. At 
some small Q the B function has a zero at some value az, slightly 
larger than a itself. Clearly, a, — a is proportional to e, the two being 
related by the finite slope of B. However, the B(x) function evolves, 
shifting nearly parallel to itself, so that a* —a,, and hence a* — a are 
of order elne. 

We now show that a, — a, and hence €, is proportional to Q7”, 
where * is indeed B* = ba**o, the slope of the optimized 8 function 
at the fixed point. From p,(Q) = T — rı and the int- equation 
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B(x) 


Fig. 11.9. Sketch of the B function, with dummy argument x, in the optimal RS 
as it evolves in the case of a fixed-point infrared limit. (Compare with Fig. 11.5.) 
Here we are showing a close-up view of the region near the fixed point. At some 
small Q value the optimal couplant is a, and the value of B(x) at x = a is some 
small value e. The optimal B(x) function at this Q is shown in blue, with its 
continuation to x > a shown dashed. It has a zero at az. At Q = 0 the optimal 
B(x) function is shown in red, and the optimal infrared couplant is a*, where B 
vanishes. The subtlety is that while az — a is of order €, both a* — a and a* — az 
are of order celne. The evolving B(x) is shifting nearly parallel to itself, so that 
the slope of B(x) is almost constant throughout the region shown. 


T = K(a), we have 


pi(Q) = K(a) — 171, (11B.4) 
which gives 
~ ” dz 
bIn(Q/Ar) =b aIo (11B.5) 


where “[0]” is shorthand for a lower limit of 0 with the appropriate 
infinite-constant subtraction, as in Eq. (6.17). At sufficiently low Q 
we may approximate B(x) by —9* (az— x). Note that we need a, here, 
and not a*, see Fig. 11.9. Thus, 


b f d 
blinQ = -2 f = const., (11B.6) 
* Jo (az — 2) 
where “const.” is finite as Q — 0. Multiplying through by 7*/b gives 
7*InQ = In(a, — a) + const’, (11B.7) 


and exponentiating gives 


az,-axQ?. (11B.8) 
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Next we note that OR/Oa tends to a constant value — which is 
easily found (see Exercise 11.8), but is not needed. Thus, at low Q 


dR OR , 
—~ = B(a)— x- (a: — -Q%. 11B. 
oO x-(a.—a) «-Q (118.9) 
Integrating this with dQ/Q, we find the expected power law: 
(R* —R) x Q”, (11B.10) 
for Q > 0. Note that the result corresponds to 
dR 
— ~w AŽ (R -R 11B.11 


for R close to R*. 

The case of second order is exceptional (see Exercise 11.8): There 
the Q — 0 behaviour is not governed by the slope of the 6 function 
at a*. The same is true of the unfixed-point case, to which we now 
turn. 

Approaching an unfixed point the natural small parameter is the 
ô of Eq. (11.30). As before, the int-G equation and p,(Q) definition 
give Eq. (11B.5), so, using Eq. (11.31) we have 


T 1 
ô = Tala? AIS 
bajn (— ln Q/AR) 


When we consider Rjg in Eq. (11B.3) it is now G(a) that tends to 
a constant, while OR/Oa vanishes, since it is proportional to 1/Bx, 
and hence to 6°. Thus, as Q > 0, 


as Q > 0. (11B.12) 


dR 1 
— x —>—. 11B.13 
aq * TOP ie 
Integration with respect to ln Q then gives 
1 1 
Re (11B.14) 


~ & (nQ)y?’ 


where the proportionality constant 1/b? can easily be found (see 
Exercise 11.9): 
ba? [(k—1 k 
je R (=) (11B.15) 


T par? a* 
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It involves the constant 7 of Eq. (11.30) which is one-half the second 
derivative of B at its pinch point ap: 


1&B I aa, = 
= 57 = Do 5 —I1)cjah-?,  (11B.16) 
r=Ap j=0 


where we have used B = B’ = 0 at ap to eliminate c and cg_, ina 
similar fashion to Eq. (11.13) for o in the fixed-point case. 

One way to look at the result is to note that Rjg as a function 
of R has the form 


dR 
dQ 
for R close to R*. Thus, it has neither a simple nor a double zero, 
but something in between. An even more intriguing interpretation is 


QZ ~ —2bi(R* — R)?” (11B.17) 


to see the low-energy prediction as 
R=R* -X (1+0(A)) (11B.18) 


with À ~ 1/(—bir ln Q) viewed as the running coupling constant of 
some infrared effective theory whose £ function starts birà?(1+0(A)). 


Exercise 11.1. Show that the formal argument around Eqs. (11.5) 
and (11.6) generalizes to any P, where, in the EC scheme, 


dagc 

dQ ` 

Exercise 11.2. Show that a stark contradiction arises if the slope 
of the 6 function at the fixed point, 


E bi 
da | ja" 


Real, and beclarc)=Q 


is taken to be a scheme-invariant quantity. Write the 6 function as 
B(a) = —ba? 5 ca’, 
i 


with co = 1 and cı = c, and the 7 sum extends to infinity for this 
formal exercise. If 6* = —b X; ict a*'t! were a physical quantity 
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then we would have 


ap da* dB* _ ‘ 
Oc; . Oc} da* 


Using 0a*/Oc} = a*)/o (see Exercise 11.3), show that this would 
reduce to 


§- iG + Daa" / Y icia" =0. 


But this equation would have to be true for all 7 = 2,3,..., which 
is clearly impossible since the second term is independent of 7. 


Exercise 11.3. Show that the result 0a/0c;|* = a*’/o, 
Eq. (11.12), obtained by considering the J;(a) integrals as a > a*, 
is the same as 0a*/0c; obtained from z (1+ca* +.+-teta**) = 0. 


Exercise 11.4. The form of the optimization equations in the 
fixed-point and unfixed-point infrared limits is worth considering. 
(Because we used the formula for the sm coefficients — which solves 
those equations — we did not need this discussion earlier.) A simple 
argument is that, if the couplant a tends to a constant value a* (or 
a*) in the infrared limit then the whole OR /ðr should be the same 
as the partial variation with a held constant: 


Thus the 7 optimization equation reduces to this term set equal to 
zero. This is indeed true since the other term 8 (a) 2 vanishes: in 
the fixed-point case 3(a) vanishes, while in the unfixed-point case 
ok vanishes. However, this simple argument is not valid for the 
other optimization equations. Why not? 
In the fixed-point case define a new RS variable 
ch = Co + c30* +--+ + aa, 

and change variables from {7,c¢2,¢3,...,ckK} to {7,¢b,¢3,..., ck}. 
Now the RS variations with respect to cj for j > 3 are at constant 
ch. Show that the simple argument is valid for the new c; equations, 
though not for the c% equation. Show that this result corresponds 
to the fact that 8;(a) — a*’~?o(a) vanishes at a = a*. 
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Verify that a similar result holds in the unfixed-point case holds 
if we define 


i k-2 
Co = C2 + C3Ap + +++ + CKap A 


in that case. 


Exercise 11.5. Derive Eq. (11.33), the formula for the sm 
coefficients in the pinch-mechanism case. First, from the form of 
the I; integrals in Eq. (11.32), find the asymptotic forms of the B,, 
and hence the H; functions, remembering that the infrared limit of 
a is a*, which is distinct from the pinch point ap. Then substitute 
in the general formula, Eq. (9.13), and simplify. 


Exercise 11.6. An alternative method for treating the pinch 
mechanism in fourth order is as follows. First, use the two 
conditions, Eq. (11.34), satisfied by the pinch point a, to show 
that 


K (3 + 2cap) 
ġ=-— 5 i 
ap 
* (2 + Cap) 
C3 = 3 
ay 


Then substitute these and Eq. (11.44) into the definitions of the 
p2 and p3 invariants to obtain two equations for ap and a*: The 
first is Eq. (11.47) and the second is a linear combination of it 
and Eq. (11.48). (Direct manipulation of these equations leads to 
a result for ap that is more cumbersome than Eq. (11.49).) 


Exercise 11.7. Using the results for the p invariants for Re+e- 
quoted in Appendix 10.5, find the large-b (nf —> —oo) limit of the 
fourth-order OPT formulas, Eqs. (11.28), (11.29), and show that 
these yield Rž,- ~ 0.84/b. Show also that y* tends to a finite 
limit of about 2.76. (Both these results extrapolate quite well as 
far as nf ~ 6.) 


Exercise 11.8. Near a fixed point, as discussed in Appendix 11.B, 
OR/Oa tends to a constant value. Show that, in (k + 1)th order of 
OPT (for k > 2) 


OR)" pP 
dal (k—1)’ 
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and that 
xP 
Pa 
R* = R ~v ——e. 
(k — 1) 


The case of second order (k = 1) is exceptional. As discussed in 
Sec. 11.3, for negative c the 6 function has a zero at a* = —1/c, 
but the optimal rı coefficient and R diverge as Q —> 0. From Eq. 
(11B.3) show that 


ge = —pbha” t! 


dQ , 
and hence show that 
R ~ pba*?*! (—InQ) 
as Q > 0. 


Exercise 11.9. Find the constant of proportionality in 
Eq. (11B.13) and hence obtain the result for bir in Eq. (11B.15). 
Show that, for the P = 1 case at fourth order (k = 3) it reduces to 


dp \7b 
bir = 2ap(3-+ cay) ( 2) =, 


T 


Also show that, approaching an unfixed-point infrared limit, 


par? ey 
R* = R ~ ——7| — 82. 
(k—1) \ap 


Exercise 11.10. Consider third order (k = 2). The nature of the 
infrared limit depends on the two invariants c and p2. (The former 
depends only on the theory, while the latter depends on the specific 
physical quantity being considered.) The object of this exercise is 
to map out, in the c, p2 plane, the regions of different infrared 
behaviour. 


(i) First, consider the EC scheme. The fixed point, if one exists, 
is the smaller positive root of 1 + ca + c2a? = 0, with c2 = po. 
Show that contours of fixed a* are straight lines in the c, p2 
plane, whose envelope gives the boundary of the fixed-point 
region. That boundary is the positive c-axis (p2 = 0) and the 
half-parabola pə = ie? for c < 0. Above this boundary there 
is Landau-pole behaviour. 


(ii) 


(iii) 
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Repeat the analysis for OPT. It is somewhat simpler to 
consider the c,p plane, where ph = p2 — tc’, since then 
the contours of fixed a*, given by Eq. (11.22), remain straight 
lines. Show that the envelope of these straight lines, for c < 0, 
is ph = fc’. 

However, for a viable fixed-point solution a* must not only 
be the smallest positive root of the a* equation, it must be 
the smallest positive root of B*(x) = 1 + cx + cha?. At this 
order, with only two roots, it suffices to check that the slope 
parameter g is positive. Show that this requires a* < ay 


and hence ph < RC. The pinch mechanism operates for 


ROC <ph < tc for negative c, see Eq. (11.43). There the 
contours of fixed a* are given by Eq. (11.41). 
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Chapter 12 


Optimization of Factorized Quantities 


12.1. Introduction 


Factorized quantities have one non-perturbative factor and another 
that is perturbatively calculable. The factorization itself requires 
arbitrary choices, which give rise to a “factorization-scheme- 
dependence problem” in addition to, and intertwined with, the 
renormalization-scheme-dependence problem. The difficulties ini- 
tially appear formidable and have only recently been clarified, but 
when the smoke clears the resulting optimization procedure is only 
slightly more complicated than for purely perturbative quantities. 
The prototypical case arises in deep-inelastic leptoproduction, 
where a high-energy lepton collides with a proton, or other hadron, 
exchanging a virtual photon of large virtuality Q?. Neglecting power- 
suppressed terms, the nth moment, i; degn F (z,Q), of the non- 
singlet proton structure function can be factorized into the form 


Fn (Q) = (On(M))}Cn(Q, M), (12.1) 


where (O,,(M)) is an operator matrix element, Cn is a coefficient 
function, and M is some arbitrary “factorization scale.” (From now 
on the moment index n will be suppressed.) 


This is an open access book chapter published by World Scientific Publish- 
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The operator matrix element (O(M/)) has an M dependence 
given by its anomalous dimension 


—-— = yo. (12.2) 


While (O(M)) itself cannot be calculated perturbatively, its anoma- 
lous dimension, yo, has a calculable perturbation series of the form 


yola) = —bga(1+ gia + ga? +- i (12.3) 


The leading-order coefficient is written as —bg for later convenience. 
While g is invariant, the other coefficients, gi, g2,... are scheme- 
dependent. The expansion parameter here, a = a( M), is the couplant 
in some arbitrary RS whose renormalization scale is M. 

The coefficient function C can be calculated as a perturbation 


series: 

C(Q,M) =1+na+roat+---, (12.4) 
where @ is the couplant of some other arbitrary RS — which can 
be different from the RS used to define a. It can have a different 
renormalization scale M, and different RS labels ¢2,.... Perhaps the 


easiest way to understand that the RS’s for a and à can be distinct, 
without inconsistency, is to imagine that first both (O) and C are 
calculated in the same RS and then a substitution & = a(1 + via + 
vga? +--+), with arbitrary v,,v2,..., is made in the result for C. 
In terms of renormalization constants, the Zo constant needed for 
the renormalization of the operator O (which is genuinely an infinite 
change of normalization) must be consistent between the calculations 
of C and yo, but the reparametrization step — the substitution of 
a = Zadpare and G = Z,dpare in the bare forms of Yo and C, respec- 
tively — can involve distinct Za and Z, renormalization constants. 

Thus, what we shall call “RS/FS dependence” involves a choice 
of factorization scheme (FS), parametrized by gj,92,..., and two, 
independent, choices of RS for a and ã that are labelled, respectively, 
by T, C2,c3,... and by 7, C2, é3,..., where 


7 =bln(M/A), 7=bln(M/A). (12.5) 
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In this chapter we shall omit the tildes over A, used previously 
to distinguish our definition from the more conventional one. Note 
that, without loss of generality, we may assume that the two 
RP’s, for a and à, are defined so that their A parameters are the 
same: If this were not the case initially, we could trivially redefine 
the renormalization-scale parameter in one of the schemes by a 
compensating factor. 


12.2. The Form of (©) and the Invariance of Its 
Normalization Constant 
Integrating Eq. (12.2), utilizing the $-function equation 
ye = B(a) = —ba? (1 + ca + ca +-->) (12.6) 
au ~ la) = ta ca + ca ‘ : 


gives 


(Gi = tentisk ees ( f i inti). (12.7) 


(Note that the M dependence of (O) comes solely from the M 


dependence of a.) The constant of integration may be written as 
a constant A defined by 


(O) = Aexp (f° arte) r [ uae), (12.8) 


where, as with the definition of A, the lower limit of x — 0 in each 
integral produces a divergence that cancels between the two integrals. 


The normalization constant A is not calculable from perturbation 
theory, but is RS/FS invariant, as we now show. 


Theorem (Politzer and Stevenson). The normalization constant 
A in Eq. (12.8) is RS/FS invariant. 


Proof. The proof is directly analogous to the proof of the 
Celmaster—Gonsalves (CG) relation (see Sec. 6.5). Let primed and 
unprimed quantities refer to two different schemes (both with the 
same value M for their scale argument). Then we have 


a=a' (1+va'+---), 


(12.9) 
(0) = (O') (1+ wia' +++), 
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where the coefficients v1, w1,... can have arbitrary values. Taking 
the logarithm of Eq. (12.8), after dividing through by A, and then 
subtracting the corresponding equation in the primed scheme, gives 


In (2) —In (E) = [ da Lo 7 a gO, (12.10) 
The left-hand side is 


n() +m (Sr) =1n(5) +In(1+ wia +---) 


=In (=) + O(a), (12.11) 


and the right-hand side of Eq. (12.10) is easily shown to be of order a. 
(The key difference from the A case is that the yo factors make the 


integrands less singular at the x = 0 endpoint; 4 rather than +.) 
As in the CG argument, we may now consider the limit M — oo, if 
b > 0 (or M > 0 if b < 0) so that all terms of order a tend to zero. 


Thus, we see that 
A’ 
la 3) 12.12 


so that A’ = A. Thus A is the same in all schemes. As with the 
CG argument, the M — oo limit is merely a convenient trick; 
the unwanted terms actually cancel for any M. One can show this 
explicitly by using the method of Osborn’s proof of the CG relation 


(see Exercise 12.1). 


The notation conceals the fact that A depends on the moment 
index n and on the specific hadron whose structure function one 
is considering. For any given hadron the set of A,’s provides a 
scheme-independent characterization of the hadronic wavefunction 
information. The A,’s can be fitted to the data for one experiment 
(say, deep-inelastic leptoproduction) and used to make predictions 
for another process (say, Drell-Yan). Given the A,,’s one can use 
perturbation theory to predict the structure-function moments — 
or, inversely to find the A,’s from experimental structure-function 
data. One will need perturbative calculations, for each n, of the r; 
coefficients in C and the g; coefficients in yo, as well as the 6-function 
coefficients. 
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12.3. Second-Order Approximation 


We first discuss the second-order approximation, which corresponds 
to truncating the series (12.3), (12.4), and the -function after two 
terms. The integrals in Eq. (12.8) become 


[ dz —bgx(1 + gız) -f a gx 
br? (1+cr) Jo x?(1 + cx) 
a 1 i 1 c 
=o | de-s f a(1-) 


=g (Z In(1 + ca) + ln |ca| — In(1 + ca)), (12.13) 


which exponentiates to 
|cal9(1 + ca) 7907919, (12.14) 


Substituting in Eq. (12.1), one obtains the second-order approxima- 
tion to F as 


FO) = Alcal9(1 + ca) 90/9 (1 + ry). (12.15) 


This approximant depends on RS/FS choices through three variables, 
T, 7, and gı. Partial differentiations of Eq. (12.15) yield 


1 OFC 1 7 7 _ Or 

ae oe ana) (-2a + ca)r1 +4 =). (12.16) 
1 oF? a Ory 

FQ) Ər —ga(1 + gia) ar E Dr? (12.17) 
1 ore g a Or, 

Fe =~ Intl —— =. 12.1 

FO Og, ; n(1 + ca) + (i+ na) Dar (12.18) 


Self-consistency of perturbation theory requires these variations to 
be of order a?. Noting that @ = a(1 + O(a)), we see that 


“=p, eg. ee (12.19) 
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so that rı has the form 


rı =9(F — 91 — 01(Q)), (12.20) 


where o(Q) is an invariant. 

Substituting Eq. (12.19) back into Eqs. (12.16)—(12.18) and 
equating to zero produces the optimization conditions. Since Or; /OT 
vanishes, the solution to the optimization equation (12.16) is simply 


re = 0, (12.21) 
The second optimization equation, from (12.17), then reduces to 
a = a(l + gia), (12.22) 
and (12.18) gives 
In(1 + ca) = ca. (12.23) 


Eliminating à between these last two equations gives us the optimal 
gı in terms of a: 


grt = In(1 + ca) — ca 


= 12.24 
1 ca? ( ) 
Also, from the int-@ equation at second order, we have 
1 
r=24 ch =]: (12.25) 
a 1 + ca 


Substituting for 7 and for gı in Eq. (12.20) and equating to zero, 
since ry = 0, we find 


In(1 — (ca)? 1 
n(1 + ca) — (ca)*In a 


| = ca (2 — acı (Q)), (12.26) 


which determines the optimized a in terms of the invariant quantities 
c and o1(Q). Substituting back in Eq. (12.24) then fixes g?™*. The 
final optimized result, from Eq. (12.15), is 


Fo) = = Alca|9(1 + ca) 79079 "o, (12.27) 


Note that the optimization condition roPt = 0 means that Copt = 1, 
so that all perturbative corrections are effectively exponentiated 
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and reabsorbed into the anomalous dimension by the optimization 
procedure. As we shall see later, this property holds at any order, as 
first noted by Nakkagawa and Niégawa. 

Also note that while the value of à (and hence 7) is determined, 
it is not needed to obtain the result for P 


12.4. RG Equations 


As discussed above the RS/FS variables are T, cj, 7, Čj, and the 
gi coefficients. We now write down the RG equations expressing the 
fact that the physical quantity F is independent of all these variables. 
Symbolically, we have 


1 OF 

Fox =o (12.28) 

where X stands for any of the set of variables {7, cj,7,¢;, gj}. 
Recalling the factorized form F = (O)C of Eq. (12.1), and noting 


that (O) is manifestly independent of M, we see that 


10F 10C 
—— = -—. 12.29 
FOr COFr ( ) 
The same argument applies to the ¢; derivatives, since (O), while it 
depends on a and its RS variables 7, cj, is manifestly independent of 
a and its RS variables 7,¢;. Thus, the first two RG equations have 


the familiar form 


ð b(a) ð = Ea ” 

(Sl, 7 weno j=l (12.30) 
ð zoan 0 , 

(se + Bila) gz) C =0, ee (12.31) 


where the first term collects dependence from the r; coefficients of C, 
while the second term collects the compensating dependence via a. 
The other RG equations all take the form 
10C 1 OO) 


oor or (12.32) 
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where X is any of the variables 7, cj or gj. The first term only involves 
dependence via the r; coefficients — indeed we are tempted to add 
“a” (meaning “with @ held constant”) to the notation, to match 
Eqs. (12.30), (12.31), but it is unnecessary since @ is manifestly 
independent of 7,c; and gj. The second term can be evaluated as 
follows. In the case X — 7, we may simply use the definition of yo, 
Eq. (12.2), to get 


1 OO) _ 70 
— = —. 12. 
O ar 8B ee 
For X — c; we can first write 
1, O10) 1 OO) 1 d(O) ða 
S = — — — 12.34 
(0) 8c; ~ (0) dq | O da Oe,’ eee, 
and then use Eq. (12.8) to obtain 
1 OO) a YolT), j+2 , Yola) 
— =] d brit? + (a). 12.35 
Oy be; Io BP taa 
Although we return to this form later, for the present we rewrite it as 
1 a0) f * Pile), 
— = dx T); 12.36 


where yh (x) = dyo/dx. The equivalence to Eq. (12.35) can be shown 
by integrating by parts and then using the differential equation 
satisfied by the 6; functions, Eq. (7.12). (See Exercise 12.2.) Finally, 
for X — gj we find, from Eq. (12.8), 


1 OO) | a gitt 


Thus, the RG equations, in addition to Eqs. (12.30), (12.31), are 


Ee eG er (12.38) 
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Í OC a B;(x) ; 

kauan = = 2 ae 12. 

an +f deol), =0, j=2B3. (1239) 
1 3C 1 gi th 
— eh dx —~ = 0, el ee 12.40 
Cag 2h BR) ee) 


As usual, the RG equations determine how the coefficients r; 
must depend on the RS/FS variables. We now rewrite the RG 
equations to facilitate finding these dependences. First, we use the 
series for yo and C: 


yola) =—bg giat, C=) ria’, (12.41) 
i=0 i=0 


with ro = go = 1. Second, we convert the 6,6; functions to the 
B, B; functions of Sec. 7.2 (whose series begin 1 +.---). A third 
simplification, concerning the lower limit of the 7 summations, is 
discussed below. We obtain 


or — @B(a) Sina’! =0, (12.42) 
i=1 T i=1 
De 4. BAG 
> spat ai BO) Sina =0, (12.43) 
i=j+1 G J i=1 
= ——a — ga X gia = 0, (12.44) 
i i= A i=0 
IS Org. g -1 Bj (2) 
—— | dex * 1)gizt =0, (12.45 
C1065" jh * Bley 24+ Leo ea 
1 ori 6 a gi! 
= dx—— =0. 12.46 
Tog" +9 f eE pone) 


The i summations of the Or;/OX terms inherently begin with i = 1, 
but in the c; and g; equations, where the second term starts only at 
order a/, it is immediately evident that r; cannot depend on Cj OF 
gj for i < j. Thus, we may begin those i summations at i = j. For 
the čj equation a stronger result holds, since Or;/O¢; must vanish 
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for i = j as well as for i < j. This observation is crucial for the 
“exponentiation theorem” proved in Sec. 12.6. 

In (k + 1)th order all the sums would go up to i = k only and 
the equations would be satisfied, in an arbitrary RS/FS, only up to 
remainder terms of order a*+!. The vanishing of all terms up to and 
including a* fixes the RS/FS dependence of the r; coefficients, and 
leads us to identify a set of invariants, gj, as discussed in the next 
section. 


12.5. Invariants 


The scheme dependences of rı were already found in Eq. (12.19) and 


led us to the first invariant 
ry 


oi(Q)=T -g1 - = (12.47) 


It is Q dependent because rı, when calculated from Feynman 
diagrams, will contain a term —bgIn(Q/M). As in the discussion 
of Sec. 7.3, one can write o1(Q) as bln(Q/Ar), where Ap is a scale 
specific to the quantity F, but related in an exactly calculable way 
to the A of some universal, reference RS. 


The earlier literature used an “invariant” «; given by 
kı =11+g9gi + bgIn(Q/M). 


It is true that «1 is invariant under changes of FS and renormal- 
ization scale, with the explicit gı and M dependences cancelling 
the implicit gı and M dependences of rı. Where «kı fails to be 
invariant is under a change of RS/FS that leaves g; and the 
renormalization scale M unchanged, but changes the RP, so that 
a’ = a(l + via +---), with some arbitrary vı. Under such a 
transformation the af factor in (O), see Eq. (12.14), becomes 
(a)! = a9(1 + guia +---), so the coefficient rı must become 
r} = rı — gv, to leave F = (O)C invariant. Thus, « = «1 — gv1. 
Since our 01(Q) is 


a1(Q) = bIn(Q/A) — k1/9, 


this change in «ı cancels with the change from A to A’, by the 
Celmaster—Gonsalves relation. 
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The higher invariants, o2,03,..., can be defined to be Q- 
independent. As with the p; invariants, it is convenient to define the 
o;’s so that they reduce to the -function coefficients cj in “effective 
charge” schemes, defined by the RS/FS choices g; = 0, r; = 0. The 
invariants, so defined, depend on 7 and 7 only via the difference T —7 
and have no dependence on Q or A. 

To find the invariants we will need the conversion between à and 
a; either @ = a(1 + Via + Vea? + ---) or its inverse 


a=a(1+Vja+ Vea" +---). (12.48) 


As discussed in Exercise 7.4 in Chapter 7, the V; or V; coefficients 
can most easily be found from the relation between the 8 functions: 


B(a) = (da/da)8(a). (In fact, the calculation mirrors that for the p; 
invariants.) The first three coefficients are 


Vir, 


Va =F —1) + (#7) - (@ - 0), 


Vs = G—7) + Žel? — T)? +96 + 3c2)(ř — T) — (6s — c3). 
(12.49) 


Note that the V;’s do not only involve differences cj — Čj. It is 
true, though, that the V; coefficients of the inverse relationship are 
obtained by exchanging all plain and tilde variables. 

We now turn to a calculation of the invariant o2. Expanding 
Eqs. (12.42)-(12.46) in powers of a and and using the above 
relations we can extract the self-consistency conditions. From the 
lowest-order terms we recover Eqs. (12.19) for r1°’s derivatives, plus 
confirmation that rı does not depend on the other RS/FS variables 
(c2, €2, or g2). From the next-order terms we find 


o O 7 

S Srp S aol ++? -7), 

Ore pin Ore g 

0, E. (12.50) 
Ore Ore g 


Cs 
ogi Gea es z= =. 
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Integrating each of these equations individually is easy, but com- 
bining the results consistently is a little tricky. However, it is 
straightforward to check our result that rə has the form: 


_l 2 2, Tt ~ 
T2=5 “Set amie oo; age Agung E T) 


+ const., (12.51) 


where the constant is independent of all the RS/FS variables. The 
constant can be conveniently written as $09 so that the invariant o 


is given by 


2 2r2 ri r? z 
02 = C2+g2—g1e— 91 + —— aot gts) —(7-T). (12.52) 


This reduces to c2 in the “effective charge” scheme mentioned earlier. 

An easier and more systematic way to calculate the o; invariants 
is to find them as the p; invariants associated with the physical 
quantity (the scale dimension of F’) 


pe. (12.53) 


The perturbation series for D can be found in terms of the C and yo 
series in various ways. Perhaps the simplest is the following. First, 
note that all the Q dependence resides in the coefficients of C. For 
dimensional reasons such Q dependence can come only via the ratios 


Q/M and Q/M. Thus, 
Q dC 1 dC ~ OC 
24 =M M—}| |}. 12.54 
C dQ = ( dM OM ) ae 


The M dependence of C must cancel out with that of (O) in the 
product F = (O)C, so that 


MdC M dO) 


te ee 12. 
CdM (OaM i 
while C is independent of M, so that 
~ aC ~ OC =, OC 
7 ant |, B(a) 5 (12.56) 
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From these observations we see that 


B(@) aC 


D= —. 12.57 
Yo+ C Oa ( ) 
Thus, D is, in a sense, a “physicalized” version of Yo. 
Substituting in the above formula we find 
D = —bga(1+ gia + goa? +---) 
Ipo ohr 
Horer j A fee 


(L+riã+.---) 


We could now expand out in terms of à, converting a to à using 
Eq. (12.48). Alternatively, we can eliminate & and find the series 
expansion in terms of a. The results are more compact in the a 
scheme: 


D = —bga(1 +rPa +r?a? +---), (12.59) 

with 
r? = gı + 11/9, (12.60) 
r? = g2 + - (2ra ter, — r? — 2ri(F T)), (12.61) 


and so on. Note that these coefficients are independent of the FS and 
independent of the tilde RS variables, with the explicit g; and T, Čj 
dependences exactly cancelling with the implicit dependences from 
the r; coefficients; see Eqs. (12.19), (12.50). Thus, the r? coefficients 
only depend, in the usual way, on the RS variables T,c; associated 
with a. 

As usual, we can construct the p; invariants for the quantity D: 


pr (Q) =F, (12.62) 

py =at =e =N, (12.63) 

and these coincide with the o’s. Indeed, it is easy to see that 
the “effective-charge-type” RS/FS used in the definition of the 


o’s corresponds to the usual effective-charge scheme for D, so the 
equivalence of p? to gj is true for all 7. 
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The calculation can be straightforwardly extended to higher 
orders. Defining 


A=7-7r=bIn(M/M), s= Ż, (12.64) 


the first three invariants are 


o31(Q) =T-g — 51, (12.65) 
02 = C2 + 92 — ge — gi + 282 — 29151 
7 +g) — 251A, (12.66) 
o3 = c3 + cg] + 49? — 6g192 + 293 — 2c291 


+ 6č2s1 — 4c9g151 + 129751 


69281 5cs? 2cgs? H 12918? 


+ 6gg1s? + 4s? + 6gs? + 29°53 


6c281 + 4cs2 — 1291829 — 125152 — 698152 + 653 


+ (129181 — 10cs, + 12s? + 6gs? — 12s2)A + 68, A?. 
(12.67) 


Using these formulas the values of the invariants can be found from 
Feynman-diagram calculations performed in any convenient RS/FS. 


12.6. The Exponentiation Theorem 


The (k+ 1)th-order approximation is defined by truncating the series 
for C, yo, and B. The resulting approximant, in general, will have 
a residual RS/FS dependence that is formally of order a*t+!. The 
optimization conditions correspond to requiring the RG equations to 
be exactly satisfied, with no remainder. (To avoid notational clutter, 
we leave it understood that, henceforth, any RS/FS-dependent 
symbol (a, &,r;, etc.) stands for the optimized value of that quantity.) 

At second order we saw that the 7 optimization equation gave 
rı = 0. In third order (k = 2) the 7 equation (12.42), in which 
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Or2/OF = rı, reduces to 
(1 + ca + č&2ã°) (rı + 2r2ã) — rı = 0. (12.68) 


Also, the č equation (12.43), in which the B2(ã) factor cancels out 
because Or2/0¢2 = 0, becomes just 


r+ 2roa = 0. (12.69) 


Substituting this back into the previous equation gives ry = 0. 
Substituting that result back into the second equation then gives 
rg = 0. The result generalizes to all orders. 


Theorem (Nakkagawa and Niégawa). The solution to the T and 
Cj optimization equations is 


Try = rg = = TR =O. (12.70) 


Thus, C = 1 in the optimal scheme, and all perturbative corrections 
are effectively exponentiated and reabsorbed into the anomalous 
dimension Yo. 


Proof. The č optimization equation follows from Eq. (12.43): 


Or; -; 
Sri gt 4 gir BG) 90 _ o, (12.71) 
j J 


where 0C/d@ = `$] ira’. Recall that all terms up to and 
including @* must cancel in any RS, thus determining Or;/ čj. By 
starting the sum at 7 = j + 1 we have already used the fact that 
Or;/OC; must vanish for i < j and for i = j, as noted at the end of 
Sec. 12.4. 

We begin by considering the case j = k. The first term vanishes, 
as there are no terms in the sum, so we find that in the optimal 
scheme 


aC 


au =o (12.72) 
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Next, consider the case 7 = k — 1. In any scheme, cancellation of the 
a* terms requires 


Ərk rı 


OCEK—1 an ko? 


(12.73) 


In the optimal scheme the left-hand side must vanish, since 0C/04a 
vanishes in the optimization equation (12.71). Thus, in the optimal 
scheme, rı = 0. Proceeding to the case j = k — 2 we can find 
Or, /OCK—2 as a sum of ric and r2 terms. In the optimal scheme 
this must vanish, and since we already have rı = 0, we now find that 
T2 = 0, too. We may then proceed to successively lower j cases to see 
that other r;’s vanish. Finally, we reach 7 = 1, where we are dealing 
with the 7 equation, which gives us rg—1 = 0. Substituting back into 
0C/04 = $2 ir,a’—! = 0 then shows that rẹ = 0. 


12.7. The Optimization Equations 


The fact that C = 1 in the optimal scheme allows us to simplify the 
remaining optimization equations, which follow from Eqs. (12.44)— 
(12.46) with the i summations truncated at i = k. 

We first recall that the B;(a) functions are related by 


pie Yai) (12.74) 
to the I;(a) integrals 


a gi? 
i(a)= | di (12.75) 


ee Sec. 7.2. e gj optimization equations involve a related set o 
See Sec. 7.2.) The gj optimizati ti invol lated set of 
integrals 


J;(a) = a d (12.76) 


while the c; optimization equations involve 


Tjs(a) = (i+ 1) [ dx x'I;(a), (12.77) 
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which can be simplified by interchanging the order of the two 
integrations, as follows: 


hala) = (+1) fave! f a y? 
jila) = U+ Tr’ y= 
: 0 0 Bly)? 
[a y fa (4 1e! 
= aay LANG x 
0 Biy)? y 
m UA eii au 
= | dy- (a - : 12.78 
f YEGE | yg) ( ) 
giving us 
Tya(a) = a *T; (a) — Tig 541 (0). (12.79) 


(This corresponds to going back to the form in Eq. (12.35) rather 
than Eq. (12.36); compare with Exercise 12.2.) 
Thus, the 7, cj, and gj optimization equations can be written as 


k k 


Are. 
Sa! 90 gat = 0, “p= 1 (12.80) 
m OT é 
i=1 i=0 


k k 
Ori l 
a +9 olja) =0, j=2.k, (12.81) 
i=j 9 i=0 


> a’ + gJj4i(a)=0. j=1,...,k. vee 


In each of these equations the first term is a polynomial in a 
that must precisely cancel out the terms up to and including a* 
present in the second term, if it were expanded out in a power 
series in dG. Previously, we have used the notation T,[G(a)] = 
Go + Gia + --- + Gna”. Here we will need T,, as the equivalent 
operation in the expansion parameter a. Thus, we may rewrite the 


equations (swapping the order of the two terms and dividing out 
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a g factor) as 


k k 
aX gia — Th pA =0 =P (12.83) 
i=0 i=0 
k 


k 
yatla) - Ty > a) =0, j=2,...,k, (12.84) 
i=0 i=0 


Jj+ı(a) = Tr [J+ (a)] =0. J = 1, sae oy se (12.85) 


However, note that the arguments of the T;,’s are all functions of a, 
rather than ã, so it is best to think of the T;,[G] operation in three 
stages (i) expand G as series in a up to a*, (ii) convert a to à using 
Eq. (12.48): 


a=4(14+Vja+ Vea? +--+), (12.86) 


(iii) re-expand as a series in &, and truncate after the a* term. 

A further simplification results from the realization that, since 
C = 1, we do not need to know the optimized value of a; nor do we 
need to know the ¢;’s or 7: they do not enter into the optimized result 
for F, which just involves evaluating (O) in the optimal scheme. 
Thus, what we need to do is to take combinations of the optimization 
equations in which à and the V;’s cancel out. From the resulting 
equation combinations we can solve for the g; coefficients in terms of 
the “principal variables” a, c2,... cz. (Note that the J and J integrals 
are functions of these principal variables.) Finally, we can use the 
invariants, o; and o1(Q), and the int-8 equation to determine the 
optimized result. Note that when r;=0 the o;’s have exactly the same 
form as the usual p; invariants with g;’s in place of r;’s. 


The optimization equations involve I’s up to [2,41 and J’s up 
to Jx41. However, using relations between the J’s and I’s and the 
complete-sum identities obeyed by the I’s, these integrals can all 
be expressed in terms of I> to Ip. (See Appendix 9.A.) 


It may be possible to find a general solution for the g; coefficients 
in (k + 1)th order, akin to the results of Chapter 9. However, we will 
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content ourselves with illustrating the above observations in the case 
of third order. 


12.8. Third-Order Approximation 


In third order (k = 2) we have four remaining optimization equations, 
in the variables 7, co, g1, and g2. From Eqs. (12.83)—(12.85) these are 


a(1 + gia + goa”) — a — (g1 + Via? =0, (12.87) 
(al — I3) + gı (a° Iz — Ia) + g(a D — Is) 

-1g =0, (12.88) 

J- a (-5 Vi) q =0, (12.89) 

J3 — 48? =0 (12.90) 


Taking the gı equation minus the 7 equation cancels the a terms 
and, not coincidentally, the V; terms, leaving 


Cc a 
Jo —a(1 + gia + goa?) + (5 i a) a =0. (12.91) 


An à? term remains, but we can substitute from the gz equation to 
obtain 


Jo + (e+ 291) J3 — a(1 + gia + goa”) = 0. (12.92) 

Taking the go equation minus the c2 equation cancels the a? terms, 
giving 

J3 — ((al = Iz) + gleh _ I) + g2(a In _ I5)) = 0. (12.93) 


We may solve these last two equations for g1,g2 in terms of the 
principal variables a, c2. 


As mentioned above, we can express all the integrals involved 
at (k + 1)th order in terms of just Iz to I;. In this case, then, all 
the integrals can be expressed in terms of I> alone. The resulting 
expressions are rather cumbersome, though. 
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From the four original equations we have extracted just two 
equations that give us the gj, g2 coefficients that we need. There are 
effectively two other equations that we can just ignore; they would 
determine @ and Vj (which gives 7 and, combined with the int-B 
equation of the tilde scheme, would then fix č2), but we have no need 
to obtain values for these variables. 

To relate the principal variables to Q and the invariants (whose 
values are to be obtained from Feynman-diagram calculations per- 
formed in any convenient scheme), we substitute the optimal-scheme 
quantities into the expressions for 72 and o1(Q), combining the latter 
with the int-$ equation to eliminate 7. In the optimal scheme, since 
ri (and hence s; = r;/g) vanish, the formula for a2 reduces to 


02 = C2 + 92 — gic — 9, (12.94) 


which is the familiar form of a pə invariant, but with g;’s as the 
coefficients. Similarly, in the optimal scheme 


o1(Q) =T- g = K® (a) g, (12.95) 


where KC) (a) is the third-order approximation to the K (a) function 
of Sec. 6.3. (It can also be expressed in terms of Is; see Appendix 
9.A.) 


12.9. A Simpler Approach 


In fact, there is a simpler approach that allows us to get directly to 
the equations determining the optimal g;’s. Consider the physical 
quantity D defined in Eq. (12.53), which we showed is given by 
Eq. (12.57), so that D = yo when C = 1. That suggests that we 
consider F in the form: 


“ D(x) 
F = Aexp dx : 
fo} = B(2) 


where “[0|” is a shorthand for the same “lower limit of 0 with 


(12.96) 


subtraction of the suitable infinite scheme-independent constant,” as 
in Eq. (12.8). Formally, this expression for F is valid quite generally, 
and is independent of the RS used, so it satisfies RG equations saying 
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that the total dependences on 7 and c; all vanish. What we are doing 
in RS/FS optimization is equivalent to a normal RS optimization 
applied to F, except that the approximants being optimized are not 
truncations of the perturbation series for F, but are approximants 
formed by truncating the perturbation series for D and 8. That is, 
the (k + 1)th approximant to F is given by substituting 


A D(z) - Di 2 k 
D(z) = = x Sor} x’, B(x) = —br Ya (12.97) 
i=0 j=0 


into Eq. (12.96). The optimization equations follow from requiring 
the 7 and c; derivatives to vanish. (Note that when we take such 
derivatives the infinite constant plays no role and the “[0]” lower limit 
can safely be replaced by 0, since the resulting integrals converge.) 
For T we have 


1 


10F  ðaD(a) : 
fae, way 


Fôr Or Bla) 


k 
= -o (8% Ti Pa) (12.98) 


while for cj 


b= 10F _ es f an (3 


1 ea ne) 


=- = —| =~+ 
F dc; ðc; pla) ðcj|, L(x) B(x) 
E arP : 
=g | —D(a)lj - 3 Ter out 2 ETT 
(12.99) 
Substituting the series form for D(a) leads to 
k 
o Pia +r Pait! — 0, (12.100) 
k 


2. Jig + D? Iji =0, (12.101) 
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where J; ;(a) = att! I; (a) — Ii4;+41(a) arises from the first and third 
terms of Eq. (12.99). 

The derivatives Or? /O7 and ðr? /ðc; are the usual RS depen- 
dences of perturbative coefficients, and can be quickly found from 
the expressions for the p? invariants. Thus, 
orp =l Ore =c+2r? Ore 


gt y i) pe =k (12.102) 


Using these results, and recalling that in the FS/RS optimal scheme 
the optimized rP’s equal the optimized g;’s, the reader can quickly 
check that at third order (k = 2) Eqs. (12.100) and (12.101) lead 
directly to Eqs. (12.92) and (12.93). 
At fourth order (k = 3) the 7, c2,c3 equations reduce to 
Ja + (c + 291) J3 + (c2 + 2091 + 392) J4 
—a(1+ gia + goa” + g3a?) = 0, (12.103) 


J3 + 2g: J4 — 2,0 + g1 12,1 + g212,2 + 932.3) =0, (12.104) 


1 
z (Is 0 + g113,1 + 9213,2 + 9313.3) = 0. (12.105) 


One can explicitly check that these are indeed the equations one 
would obtain from appropriate combinations of Eqs. (12.83), (12.84), 
(12.85). 


Note that Eq. (12.96) can be written just as 
F = AQP! 


in the notation used in Sec. 2.7. The scale dimension D is a physical 
quantity that depends on a single massive variable Q by the magic 
of dimensional transmutation, as discussed in Chapter 2. As Q > 
oo one finds F x (In Q)~9, while in the infrared limit, if D tends toa 
constant D*, one will have F x Q?” as Q > 0 (see Appendix 12.A). 


12.10. Conclusions 


The optimization approach to the problem of RS/FS dependence is 
less daunting than it appears at first sight. There are 3k scheme 
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variables at (k + 1)th order and k coefficients, r;. However, k of 


the optimization equations lead to rı vee Tk 0, so that 
C = 1, so that another k variables (ř,čə2,...,Čķ) need not be solved 
for. That leaves just k combinations of optimization equations that 
can be solved for g1,...,gķ in terms of the “principal variables” 
a, C2, ...,Ck. In fact, these equations can be obtained more directly by 
the approach in the last section. By substituting in the expressions 
for the invariants, one can then solve for all the needed quantities. 
The last step will require an iterative algorithm, as in ordinary 
optimization (see Chapter 10). 

The RS/FS optimization of F described in this chapter is not the 
same as optimizing D as a perturbative physical quantity and then 
exponentiating the integral of D with respect to dQ/Q. However, 
there are strong indications that the two approaches generally give 
very similar results. At high energies a PWMR-like approximation 
to the RS/FS optimization equations gives the same result as the 
PWMR approximation to D; see Exercise 12.5. In the infrared limit, 
if a fixed-point limit occurs, then again the results are the same 
as for optimizing D; see Appendix 12.A. However, as discussed in 
that appendix, in the pinch-mechanism case there are interesting 
differences. 

There are applications to various quantities, besides structure- 
function moments — for example, heavy quarkonium decays to 
hadrons, B decays to charmonium, or Higgs-boson decay to hadrons. 
Such quantities have a factorized form involving the wavefunction at 
the origin or, in the last case, the quark masses. For applications 
involving parton distribution functions and fragmentation functions 
there is more work to be done. We have only considered the non- 
singlet case; the flavour-singlet case involves matrices describing 
quark-gluon mixing. Also, our analysis has used the language of 
structure-function moments, which is convenient theoretically since 
it reduces a convolution integral to a simple product. However, 
phenomenologically, it seems preferable to deal directly with the 
parton distributions using parton-evolution (DGLAP) equations. It 
would be valuable to see if the moments-based optimization approach 
can be reformulated in that language and put into practice. 
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Appendix 12.A: Infrared Limit for Factorized 
Quantities 


If a fixed-point limit occurs, the generalization of the analysis in Sec. 
11.5 is straightforward. For a near to a* one has B(x) ~ a(a* — a), 
so that I; is given by Eq. (11.10) and hence, from Eq. (12.77), one 
finds 
gtiti-2 
Lix -—li+1) z2 In(a* — a). (12A.1) 


One also has 


*j—2 
tea —In(a* — a). (12.2) 


At third order the optimization equations (12.92), (12.93) are 
potentially dominated by divergent In(a* — a) terms. Requiring these 
to vanish gives 


1+ (c+ 2gj)a* = 0, (12A.3) 
—oa* + (1+ 2g*a* + 3gžša*°) = 0, (12A.4) 


which, when solved for gj, 93, the limiting values of the coefficients 
of D, yield the equivalent of Eq. (11.20) with r’s renamed as q’s. 
Thus, the fixed-point result for F gives F x QP”, where D* is 
exactly the same infrared limit as that obtained by optimizing D 
as a perturbative physical quantity. It may similarly be shown that 
the fixed-point limit of the fourth-order equations (12.103)—(12.105) 
leads to results for gj, 95,93 that are the equivalent of Eq. (11.25). 
It seems likely that this result will hold at any order. 

If a pinch mechanism operates, however, things are more subtle 
and the result is not the same as obtained by applying ordinary 
optimization to D. The J; integral is now divergent proportional to 
1/6°, given by Eq. (11.32), so that, from Eq. (12.79), 


j-2 
i i a TT 1 
Iji (a +1 _ ait) ( = 5) +0 G) (12A.5) 
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The J; integrals also diverge, but only like 1/6: 


gi? a? T 

J; x fea eo (12.6) 
(x — ap)? + ô?) n ô 

The third-order optimization equations, (12.92), (12.93), are domi- 

nated, respectively, by J terms and J terms, giving 


1+ (c+ 29% )ap = 0, (124.7) 
(a* — ap) + gf(a*? — 02) + gh(a*® — a8) =0. (12.8) 


In the latter equation a factor of a* — ap may be discarded since 
the pinch mechanism requires that a* > ap (and we can expect the 
limit a* — ap to correspond to the interface with the fixed-point 
mechanism). Solving for gj, gå yields results distinct from Eq. (11.39). 
As before the conditions B(ap) = B’(ap) = 0 lead to Eq. (11.40) so 
ap = —2/c and ch = c?/4. Hence, 


2 * 
E c P c (6 — ca*) 
== = ss 12A.9 
sı T 4 (4 — 2ca* + c2a*?) ( ) 
Substituting in the definition of the o2 invariant gives 
3 6 — ca* 7 
ME E a 2 (12A.10) 


4 (4 — 2ca* + c2a*?) 16°’ 


which leads to a quadratic equation for a* in terms of the invariants 
c, o2. For the pinch mechanism to operate we must have ap > 0 and 
a* > ap, Which requires 


c<0 and = < = < —. (12A.11) 


The lower limit is the same as in Eq. (11.43), since it is the interface 
with the fixed-point mechanism, but the upper limit is slightly 
smaller. Another difference from the ordinary case is that the result 
for D*: 

a a* (4 — 3ca*) 


D* = —— 12A.12 
(4 — 2ca* + c2a**) ( ) 
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means that it is restricted to the range 
5 yx 
as —cD* < 3. (12A.13) 


(Recall that c here must be negative, and that D = —bgD.) 

At fourth and higher orders we will actually need the subleading 
1/8 terms in the J; and Tj; integrals. In Eq. (11.32), expanding the 
zJ~? numerator about the pinch point ap gives 


j-2 1 x! 
I; x 2 l 1 =o 
J 2 (x! + 6?) ( +O la 
Ba - 12 
CI), han 
2 ap 


where x’ = x — ap and the limits of integration may be taken as —oo 
to oo, effectively, since we only need the terms divergent as ô — 0. 
Hence, we find 


j-2 è Ş 
a% fl G-2)\G-3)1 
I~ P (5 + 2a? 5} + O(1). (12A.15) 


Using this result and Eq. (12.79) we can then obtain, for a combina- 
tion of J;,;’s in which the leading terms cancel, that 


T4118 — [jiap © pate 3((j — 2)a**t* — (i+ j — Lait). 
(124.16) 


At fourth order, the first optimization equation, (12.103), is 
dominated by J terms and gives 


1+ (c+ 2g7)ap + (c + 2cgi + 395 )ax =0, (12A.17) 


while the other two are each dominated by 1/8? terms whose 
cancellation requires 


(a* — ap) + g(a"? — a2) + g3 (a? — a3) + g3 (a*t — a8) = 0. 
(124.18) 
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If we take Eq. (12.105) minus ap times Eq. (12.104) the leading 
divergences cancel, leaving 1/6 terms whose cancellation requires 


nas(1 + 4gtap) — (1+ 2gfap + 89305 + 4g3a3)=0.  (12A.19) 


From the pinch-point conditions B(a,) = B'(ap) = 0 and the 7 
definition as 7 = 4B" (ap) we have 


343 2 3 
T Aid gatan e R u, (124.20) 
ap ap ap 


From these equations and the o2,03 definitions one may proceed to 
solve for the gj, 95, 93 coefficients in terms of the invariants and ap, 
and find a sixth-order polynomial that determines ap. Finally, the 
infrared limit D* can be found. 

The infrared behaviour of F is then F x Q?”, where the exponent 
is D(a) with g; = gž and a = a*. However, returning to Eq. (12.96) 
involving f dz D(«x)/B(x) and taking 6(x) to have the pinch-point 
form one would find that the exponent is given by D*(a) at ap, 
rather than at a*. All is well, however, because the optimization 
equation (12A.18), or Eq. (12A.8) in the third-order case, is precisely 
the condition that D*(a,) and D*(a*) are the same. 


Exercise 12.1. Give an alternative proof of the theorem that the 
normalization constant A in Eq. (12.8) is RS invariant by suitably 
adapting Osborn’s proof of the Celmaster—Gonsalves relation (see 
Sec. 6.5). 


Exercise 12.2. Show the equivalence of Eqs. (12.36) and (12.35) 

for -> in two ways: 

(i) Integrate by parts in Eq. (12.35) and use the differential 
equation satisfied by the 6; functions, Eq. (7.12). 

(ii) In the right-hand side of Eq. (12.36), express 6; (a) in terms of 
the integral I;(x) (see Eq. (12.75) and Sec. 7.2) to get 


5 2- -; f œen 


Write J; (x) as an integral over another dummy variable y and 
then perform the x, y integrations in the other order. 
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Exercise 12.3. Show that the 7 RG equation (12.44) can be seen 
as the j — 1 limit of the c; RG equation (12.45) in the same sense, 


cy” > c— (j —1)r, as in Exercise 7.3. Show that the same goes 
for the corresponding 7 and č; equations with “č” > c— (j—1)r. 


Exercise 12.4. Show that F can be expanded out as a perturba- 
tion series 


F = Alcal9(1+rfatrea? +---) 


and find the first few coefficients. Calculate the invariants for F 
(using the form for a physical quantity with the power P = g). 
Show that these are related to the o;’s, which are the p? invariants 
associated with D = (Q/F)dF/dQ in just the way expected from 
the results of Exercise 7.6. 


Exercise 12.5. Consider the equivalent of the PWMR approxima- 
tion for the optimized g; coefficients in the factorized case. Show 
that the results, for k = 1,2,3 are the same as for the r; coefficients 
of the ordinary case; see Eq. (9.25). (It seems likely that this result 
will hold for all k.) 


Chapter 13 


Exploring All-Orders OPT in the 
Small-b (BZ) Limit 


13.1. Introduction 


The constant b is (33—2n,)/6 in QCD, which means that asymptotic 
freedom is lost once np exceeds 164. The constant c, although positive 
at low ny, changes sign at nf = 87. Thus, as we saw in Chapter 11, 
when nf approaches 164 from below one finds fixed-point behaviour, 
with the infrared couplant a* proportional to 164—n f. That property 
was noted by Caswell, and later Banks and Zaks proposed making an 
expansion about ny = 164. (‘The idea generalizes to an SU(N) theory, 
and to other gauge theories; in some of those cases the critical ny 
can be an integer value.) 

The BZ expansion is a distinctly unusual sort of expansion. It 
is not like a typical perturbative method, where a theory with some 
parameter A is exactly soluble at Aà = 0 and one can step away 
from A = 0 in a Taylor-series fashion. Here the theory with ns = 
164 is not soluble; it would be a non-asymptotically-free “delicate” 
theory in which the 8 function starts at order a? (see Exercises 6.1 
and 7.7). That non-trivial special theory is not actually relevant, 
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Bla) 


Fig. 13.1. If the couplant at some finite energy scale lies in the region 0 to a*, 
then it is trapped in that region at all energies. In the BZ limit a* tends to zero. 
(Note, however, that if a > a* then we would be, in the limit, in the non-trivial 
“delicate” theory.) 


either. One must begin, not at nf = 164, but infinitesimally below 
that value. Also, one must assume that the couplant, at some finite 
energy scale, lies in the range 0 to a* ~ —1/c. It is then trapped in 
that infinitesimal region at all energy scales. See Fig. 13.1. 

The BZ expansion is normally discussed — as we shall do 
in the next section — only within a restricted class of “regular” 
renormalization schemes, where perturbative coefficients have a 
simple, polynomial dependence on ny. However, infinitely many 
schemes — and in some sense most schemes — are not “regular.” 
In particular, the “optimal” scheme is not. In “regular” schemes 
one needs only k terms of the perturbation series to obtain k terms 
of the BZ expansion, but in other schemes the information needed 
is distributed among higher-order terms. In general all orders are 
required. Turning that observation around, the BZ expansion can be 
viewed as a “playground” in which one can analytically investigate 
arbitrarily high orders of OPT in QCD. Admittedly, this adopts the 
“drunk-under-the-lamppost” principle of looking, not where we really 
want to, but where there is enough light to make a search. The deep 
and difficult issues that we would like to study — “renormalons” and 
factorially growing coefficients — are simply absent in the BZ limit. 
Nevertheless, this exploration provides some interesting insights and 
employs some methods that may have wider applicability. 

In this chapter, we only consider quantities R with P = 1. 
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13.2. BZ Expansion in “Regular” Schemes 


For nf just below 1634 the 6 function has a zero at a very small a*, 
proportional to (164 — nf). Its limiting form, 


ie kar = asi de aes (13.1) 
serves as the expansion parameter for the Banks—Zaks (BZ) expan- 
sion. To proceed, one first rewrites all perturbative coefficients, 
eliminating np in favour of ag. The first two 6-function coefficients, 


which are RS invariant, become: 


107 
b = —ao, (13.2) 
8 
1 19 
Se i3 
a + i (13.3) 


Note that c is large and negative in the BZ context. 
We will consider a class of “primary” physical quantities for which 
the p; invariants have the form 


1 2 
— mo (pi,-1 + Pi,oao + 1,195 +--). (13.4) 


Pi 
This class includes Re+e-, if we ignore the X` q; terms. Note, however, 
that not all physical quantities have p;’s of this form; the scale 
dimension D of R,4,-, for instance, would not (see Exercise 7.6). 
Within the class of so-called “regular” schemes, the }-function 
coefficients, bc;, are polynomial in nz, and hence polynomial in ao, 
so that 
1 
GC = a (Chit + Cj,9a9 + ci aag aeai -). (13.5) 
0 
Note that this equation is a property of the scheme, irrespective of 
the physical quantity, whereas Eq. (13.4) is a property of the physical 
quantity, irrespective of the scheme. For “primary” quantities in 
“regular” schemes we have 


Ti = Tio + ride + riag +. (13.6) 


In fact, for certain quantities the numerator of Eq. (13.4) is a 
polynomial whose highest term is p;,;@9’t!, and in certain “rigid” 
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schemes, such as MS, the ao series for c; and r; truncate after the 
Cii—1 and r;; terms. These properties are unimportant here, but 
are crucial in the opposite limit, the large-b approximation. 


Expanding in powers of ao the zero of the 8 function is found 
to be 


a* = ao(1 + (c2,-1 + c1,0)ao +++: ), (13:7) 
and hence the infrared limit of R is 
R= ag(1 + (r1,0 + =i + c1,0)ao Paer ). (13.8) 


Since the BZ expansion parameter ao is RS invariant, the coefficients 
in the R* series are RS invariant and can be written in terms of the 


Pij: 
R* = ao(1 + (p2,-1 + p1,0)a0 +++: ). (13.9) 


Note, though, that a* is not a physical quantity and its aọ expansion 
has RS-dependent coefficients. 

At a finite energy Q the result for R to nth order of the BZ 
expansion can be expressed as the solution an equation of the form 


1 1 R 
for n = 1,2,3. (For n > 4 there are additional terms; see Appendix A 
for details.) Here R*(™ and 4*0) are the nth-order approximations 
to R* and 7* = É. As discussed in Chapter 11, the critical exponent 
y* governs the manner in which R approaches R* in the Q > 0 limit: 


(R*-R) x Q”. (13.11) 


In the present context it is safe to identify y* with the slope of the 
6 function at the fixed point and so its BZ expansion is 


jt = T = ao(1 + giao + 9205 + O(a5)), (13.12) 


where the g;’s are the universal invariants of Grunberg: 


gı = ĉ1,0 = p1,0; 
; P 5 5 (13.13) 
92 = C19 — €2,-1 — ©3,-1 = P1,0 — P2,-1 — P3,-1- 


Exploring All-Orders OPT in the Small-b (BZ) Limit 251 


x RQ) 
R 
0.78 F 
—— ot 
Ag Q 


Fig. 13.2. Schematic picture of R as a function of Q close to the BZ limit showing 
the three regions (i) the “spike” at very low energies, (ii) the huge flat region where 
the theory is “nearly scale invariant,” and (iii) the slow approach to asymptotic 
freedom at very high energies. (Region (iii) is shown on a log scale.) 


They are universal in that they do not depend on the specific physical 
quantity R being considered, and invariant because they can be 
expressed as combinations of the invariants p;,; (combinations in 
which all the r;,; terms cancel). 

Close to the BZ limit R remains almost constant over a huge 
range of Q about Arg. This constant value is not R* but 0.78R*. 
More precisely, it is R*/(1 + x) where ny + x +1 = 0, a result 
that follows from Eq. (13.10) to leading order in ap with p,(Q) = 0, 
corresponding to Q = Ar. Only when Q/Ar becomes extremely 
small does R have a “spike” that rises up to R*, and only when 
Oy Ar becomes extremely large does R very slowly decrease to zero, 
as required by asymptotic freedom: See Fig. 13.2. The plateau region, 
where R stays within 10% of the value 0.78R*, is roughly for Q/Ar 
in the range from exp(—0.04/a?) to exp(+0.04/a?). 

Since Eq. (13.10) completely characterizes the Q dependence of 
R in low-orders of the BZ expansion, it suffices to consider R* and 
4*, both of which are quantities defined in the Q > 0 limit. 


13.3. Low Orders of OPT in the BZ Limit 


The procedure for obtaining the (k + 1)th-order OPT result for the 
fixed-point limit R* described in Sec. 11.5 requires, of course, the 
numerical values of the invariants up to pz. The great simplification 
in the BZ limit is that we can effectively set almost all the invariants 
to zero: this can be seen as follows. As ag —> 0 the most singular 
term in any of the p; is of order 1/ag, but each p; enters the analysis 
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along with a factor of a*' that is of order aĵ. Thus, to find the leading 
term in the BZ limit, we can effectively set to zero all the invariants 
except c. (Furthermore, only the —1/ao piece of c will contribute.) 
To obtain the next-to-leading correction in aj we would also need 
the 19 piece of c along with the p2,—1/ao piece of p2 (whose value 
depends on the specific R quantity under consideration). 

For k = 2 the OPT fixed-point results are given by Eqs. (11.20)— 
(11.22). In the BZ limit, we can set p2 = 0 so that the a* equation 


becomes 
(ca* + 1)(ca* — Z) =); (13.14) 


Hence, we find a* = —1/c — ag. The coefficients c3, rj, r3 all vanish, 
so, in an a posteriori sense, the k = 2 OPT scheme is “regular” in 
the fixed-point limit. The final result for R* is 


1 
R* = —- > ap. (13.15) 
Cc 


Thus, exactly as in any “regular” scheme, both a* and R* tend to 
ag in the BZ limit. The same is true for 7*, obtained from the slope 
of the optimized ( function at the fixed point. 

At higher orders, though, the OPT scheme is not “regular” — the 
optimized rž, coefficients, for instance, have 1/aj’ pieces — and the 
story is more complicated. For k = 3 we have Eqs. (11.25)—(11.29). 
The a* equation in the BZ limit (where p2 = p3 = 0) reduces to 


83 + 52ca* = 0. (13.16) 


Thus, we do not get a* = —t —> ag, but a* > Sao = 1.596ao. 


The final result for R* is not ag but is BSS? ag = 1.035a9, which is 
remarkably close. 

Results for higher orders are shown in Tables 13.1 and 13.2. The 
even-k results are significantly better than those for odd k. Note 
that a*/ag increases, apparently towards 4. It is perfectly acceptable 
for a* to differ from ag, since a* is inherently scheme dependent. 
However, R* is a physical quantity so it is reassuring that R*/ag is 
always close to 1. In Sec. 13.5, we will find a simple explanation for 
a* /ay > 4 and R*/ao > 1 as k > co. 
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Table 13.1. OPT results in the 
BZ limit for k = even. 


2 1 1 1 

4 1.85035 1.00370 0.9841 
6 2.30294 1.00214 0.9742 
8 2.58980 1.00137 0.9671 
10 2.78928 1.00096 0.9614 
12 2.93666 1.00071 0.9565 
14 3.05030 1.00055 0.9523 
16 3.14081 1.00043 0.9485 
18 3.21470 1.00035 0.9451 


Table 13.2. OPT results in the 
BZ limit for k = odd. 


k at R* 2 
ao ago ao 


3 1.59615 1.03501 0.5602 
5 2.17343 1.01119 0.5886 
7 2.51313 1.00544 0.6071 
g 2.73950 1.00319 0.6206 
11 2.90228 1.00209 0.6311 
13 3.02550 1.00147 0.6397 
15 3.12231 1.00108 0.6468 
17 3.20056 1.00083 0.6530 
19 3.26522 1.00066 0.6583 


The situation with ĝ* is less clear. This is also a physical quantity 
(with the caveats of Appendix 11.8) so we should have 4*/ap > 1 
as k — oo. The numerical results in the tables cannot be said to 
support that contention, but neither are they inconsistent with it; 
one can make good fits to the data with functions of k that very 
slowly approach 1 as k = co for both even and odd k. 

It is hard to go to much larger k with the method described in 
this section, so we turn to an analytic approach in the next sections. 
Our results — albeit in approximations to OPT rather than true 
OPT — support the claim that a*/ap — 4 and that both R*/ap and 
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4*/ao tend to 1 as k — oo: they also provide valuable insight into 
the workings of OPT at arbitrarily high orders. 


13.4. Analytic Tools for OPT at All Orders 


To make progress analytically with OPT in (k + 1)th order it helps 
greatly to deal with functions and differential equations rather than 
with 2k individual r; and c; coefficients. The set of p; invariants — see 
the discussion following Eq. (7.27) — naturally follow from a single 
“master equation,” since they are obtained by equating coefficients in 


Buo(R) = ss B(a), (13.17) 


which we shall refer to as the “invariants master equation.” 

What we need is to formulate the k optimization conditions 
also as a “master equation.” For general Q this would be a more 
daunting task, but in the fixed-point limit it is relatively simple — 
and, happily, that suffices since, as noted earlier, in the BZ limit 
(and for the first three terms of the BZ expansion), the entire Q 
dependence of R is characterized by the two infrared quantities R* 
and 4*. 

We now show that the optimization conditions in the fixed-point 
limit, Eq. (11.16), follow from equating coefficients in the following 
“fixed-point OPT master equation:” 


OR _ a-l OB(a) B(a) 
=B() - q (2 a a 2) (13.18) 


(Superscripts “(k+1)” on R and B(a) are omitted for brevity.) Note 
that a here is merely a dummy variable, while a* is the optimized 
couplant in the infrared limit. 

The first step of the proof is to note that, by the definition of a*, 
the polynomial B(a) has a factor of a* — a and can be written as 


leo) yx 2) 
B(a) > ( ) Ên, (13.19) 
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where tf, is a partial sum of 6-function terms: 
n 
i= SG (13.20) 
j=0 


with ĉj = cja*), Note that ĉn — tn_1 = ĉn and that ¢;, = 0 by virtue 
of the fixed-point condition. To show Eq. (13.19), expand the right- 
hand side, then use f = 0 and define t_; = 0 to get 


k a\n. k—1 a\ntl , 
(=) p= Y (=) Ên. (13.21) 
n=0 n=—1 


Now put n = n’ — 1 in the second sum and recombine the sums to 
get 
k pe k poa k 
D-i) = oem (Ge) e, 8.22) 
n= n=0 n=0 


which is B(a), as claimed. 
To prove Eq. (13.18), equate powers of (a/a*)™, using Eq. (13.19) 
to write B(a)/(a* — a) as a polynomial. This leads to 


1 A 
m — Cm — 2 om m—1)- 13.2 
8 é ao + tm—1) (13.23) 


Using tm —tm—1 = ém again and simplifying leads to the fixed-point 
optimization conditions, Eq. (11.16), completing the proof. 

Unfortunately, Eq. (13.18) proves difficult to deal with. To make 
progress we have resorted to two approximations, designated PWMR 
and NLS. The former was discussed in Sec. 9.3 and corresponds to 
dropping the O(a) terms in Eq. (9.25), giving 


k—2 
Sm = — Le (13.24) 


which is easily formulated as a “master equation”: 


OR 2 ƏB(a) 
a, a 


(PWMR). (13.25) 
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Looking at the above equation, or the original equation (13.18), 
it is tempting to suppose that, as k — oo, they reduce to 


ae = Bia) (NLS). (13.26) 
Oa 

We shall refer to this as the “naive limiting scheme” (NLS). It 
corresponds to a well-defined RS in which Sm = Cm, so that the 
coefficients rm = Sm/(m + 1) of the R series decrease by a factor 
1/(m + 1) relative to the coefficients of the B series. 

Clearly, this idea is very naive. In the PWMR case, the actual 
relation only reduces to Sm % Cm for m < k; that is, for the early 
part of the series only. Nevertheless, there may be a kernel of truth 
here, for if the series are “well behaved” the early terms should 
dominate. In any case, adopting this naive idea leads us in a fruitful 
direction. Our investigations below will lead us to conclude that, at 
least in the BZ context, the NLS does yield the all-orders limit of 
OPT, although it is a poor guide to how fast results converge to that 
limit. 

Using the NLS equation above to eliminate B(a) in the invariants 
master equation (13.17) leads directly to 


a? (OR\? 
Taking the square root leads to 
OR R 
= 13.2 
ða A Bro(R), ( 3 8) 


which is immediately integrable. (Recall that the partial derivative 
notation merely means that the coefficients of R are regarded as 
constant. For our purposes here it can be replaced by an ordinary 
derivative without creating confusion.) 

The BZ limit provides us with a nice “playground” for exploring 
further, since it effectively corresponds to the case Beo(R) = 1+cR. 
We continue this analysis in the next section. 
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13.5. All-Orders NLS in the BZ Limit 


In the BZ limit the only one of the p, invariants that contributes 
is c, which is negative: c = —1/ag + O(1) as a9 — 0. We may 
set Buo(R) =1+cR in this limit. (The terms neglected can only 
contribute to O(ag) corrections, as argued in Sec. 13.3.) It is 
convenient to define 


u= ae v= —cR. (13.29) 
In these variables, the NLS condition is B = 72 and Eq. (13.28) 


becomes 
Ee er (13.30) 
u 


which leads to 


[=-= -|Z (13.31) 


Performing the integral and then exponentiating both sides gives 


1-Jl-v _ 

lpas ” 
where the constant of integration has been fixed by requiring v > 4u 
as u — 0, corresponding to the R series beginning R = a(1+---). 
Inverting this equation (assuming u < 1) gives 


(13.32) 


4u 
Hence, B = 72 is given by 
l= 


(The two formulas above are key results. They show an interesting 
u — 1/u duality that we shall discuss in Sec. 13.8.) 

The fixed point, where B = 0, is at u* = 1. Recalling Eq. (13.29), 
we see that a* is —4/c — 4ag. Nevertheless, because u* = 1 in 
Eq. (13.33) leads to v* = 1, we find R* = —1/c > ao, in agreement 
with the regular-scheme result. 
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Evaluating the slope of the 8 function at the fixed point gives 


4 d 1l-u —b bag 
—b z —— =, 13.35 
(a) ea r) a > 0899) 
which seemingly gives 4* = y*/b = 540. Here the subtlety discussed 
in Appendix 11.8 comes into play, since 7* is really the infrared limit 


of the “effective exponent” 


dB ËR /dR 


ag Oe as 


(Q) = (13.36) 


Normally the second term drops out in the infrared limit because 6(a) 
vanishes at the fixed point. However, in the NLS the denominator 
gR also vanishes because it is B(a) = B(a)/(—ba?). Therefore, in 
the NLS case the second term contributes —ba? R R —ba? B which 
contributes an equally with the first term, thus DAT the previous 
result by a factor of 2. Hence, we find 7* = apo, in accord with the 
regular-scheme result. 

The preceding discussion corresponds to the NLS result 
resummed to infinite order. One must now ask: Do the finite-order 
NLS results converge to their infinite-order form — and, if so, how 
fast? At (k + 1)th order the B and v series are truncated, and v* is 
found by evaluating at u*, the zero of the truncated B. Luckily, as 
with a simple geometric series, the sum of finite number of terms can 
be expressed fairly simply. The truncated B series is 


k+1 


k 
. l-u u 
Bet) k 4,2 
1)? — —1)"k 
= G+ Tene Geen) 


x (1 +0 ns (13.37) 


Only for odd k do we get a zero. (We will discuss even k near the 
end of this section.) The zero of the truncated B is just before u 
reaches 1. If we put 


(13.38) 
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with n(k) < k, we find (noting that uk+! > e—")) that 


n(k) = 31n k — In(Ink) — In(3/4) + O S) (13.39) 


The truncated v series is 


a wa í mg “i 


(13.40) 


When we substitute u = u* we find a cancellation of the ņn(k)/k terms 
which leaves 
"E 91n? k 
walaa (13.41) 
This is in good accord with the numerical results in Table 13.3. 
A similar analysis for 47* (including the factor of 2 discussed 
above) leads to 


4* = ap(1+3(-1)**1? lnk +---), (13.42) 


which indicates that the NLS results for 7* do not converge — the 
nominal limit of ag is “corrected” by a Ink term arising from the 
series-truncation effects. We indeed see this in the numerical results 
in Table 13.3. 


Table 13.3. NLS results in the BZ 


limit. 

k 4ut* = ya RE a 
ag ao ao 

3 1.41825 0.69455 3.67 


11 2.26825 0.90345 7.14 
19 2.65953 0.95010 8.79 
51 3.25059 0.98737 11.70 
101 3.53265 0.99555 13.66 
601 3.88410 0.99976 18.71 
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Returning to Eq. (13.37) we see that the truncated B(u) function 
1 


closely approximates its limiting form TH until u gets close to 1. 
For odd k the (—1)* “truncation effect” term causes B to suddenly 
dive down, producing a zero. For even k this term causes B to 
suddenly shoot upwards and there is no zero. This means that there 
is no finite infrared limit in these orders; the “spike” in R goes all 
the way up to infinity. However, since B has a minimum very close to 
zero the running of the couplant “almost stops” here and if we were 
to evaluate v at this value of u we would find a result close to the 
R* /ao obtained in the previous odd-k order. A related observation 
is that, with only a slight change of RS, we would find an infrared 
limit arising from a pinch mechanism (see Appendix B). 

We conclude that the NLS provides a lot of insight into OPT as 
k — œ, but is only a rather crude approximation to true OPT. We 
move on to the PWMR approximation in the next section. 


13.6. All-Orders PWMR in the BZ Limit 
As before we have Buo(R) = 1+ cR in the BZ limit and we use 


u = = and v = —cR. In these variables, the invariants master 


equation (13.17) becomes 


v2 (L=) 
du 


and the PWMR master equation (13.25) becomes 


o =j 2, (13.44) 
We will proceed to solve these two coupled differential equations, 
treating k as an ordinary parameter: only later we will consider the 
other k dependence coming from the truncations of the resulting 
series at (k + 1)th order. (One can explicitly check that at low k 
this two-step approach does produce the same results as a PWMR 
version of the OPT procedure described in Sec. 13.3.) 
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We begin by making an ansatz: 


_iIdvil 


= ——__ 13.4 


where € depends on u. (We will actually want to view it as a function 
of a new variable X, introduced below, that itself is a function of u.) 
Substituting in Eq. (13.43) leads, in the same way as in the NLS 
case, to 


dv du 
ey bec 13.46 
ls —v f u é ( ) 
which leads to 


4X 


with the new variable X defined by 


X= exp | Ze, (13.48) 


or more specifically, enforcing X —> u as u > 0, 


X= uexp | a (€-—1). (13.49) 
0 u 
Note that 
dX X 
— = — 13.50 


so that the inverse relationship is 


u=xew f & (w) (13.51) 


We will now want to consider € as a function of the new variable X. 
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We can now find W a as fe gx and substitute back in the ansatz 
(13.45) to get 
(1—-X) X 


B= Wax a (13.52) 


From this we can calculate 4 gB , which, after some algebra, reduces to 


dB B o 


1 ea x), (13.53) 


du u dX 


Substituting this, and 1% = €°B from the ansatz (13.45), into 
Eq. (13.44), leads, after cancelling a factor of B, to an equation for 


E(X): 


1- ¿£ = 


2 (CE 
( dX 


dg 
= = p-1-x5), (13.54) 


Remarkably, this nonlinear, first-order differential equation is soluble. 


The trick is to write € in the form 
2X dF 
=l1-—-~—_.. 13. 
Š k F dX (135) 


This substitution, because of a cancellation of (F'/F)? terms, leads 
to a linear second-order equation for F. A further substitution, 


F=(1-X}’F, (13.56) 


leads to a Gauss hypergeometric equation, revealing that 
3 1 
F= (-n = z) (13.57) 


where n = k/2 — 1. We will focus on the case of even k. (Curiously, 
the roles of odd and even k are reversed relative to the NLS case.) 
For even k, the F function is a polynomial of degree n in X?: 


eee (2(n — i) + 1)! Oo; 
XT. 13.58 
-gmh a (n— i)! Pa ( ) 


The first few F’s are shown in Table 13.4. Note the “reflexive” 
symmetry 7 > n — i, meaning that the coefficients are symmetric 
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Table 13.4. The first few F polynomials 
and their form for large k = 2n + 2. 


k n F 

2 0 1 

4 1 1+% 

6 2 1+X+xX* 

8 3 14+2Xx?+2xX44 X’ 

10 4 14+ 4X74 2x44 4X° +X" 


co œ «6(1—X?)-3/2, (x £1) 


Vna Ee) (X=) 


n 


about the middle. In the n — oo limit F approaches (1 — X?)~3/?, 
except near X = 1, where its behaviour involves a modified Bessel 
function I; (see Table 13.4). 
To find u in terms of X it is helpful to use another representation 
of €, namely 
1 1 X dP 
-= = l- — 13.59 
E n+2P dx’ ( ) 
so that Eq. (13.51) will immediately lead to 
u= XP T, (13.60) 


Substituting the above form for z into the equation (13.54) leads 
again to a linear equation. One can verify that this equation is 


satisfied by setting 


P= (1+ X)P (13.61) 
with 
1 1 dF 
P= Ea EE (In+1- (n —1)X) F-2(1 xx), 


(13.62) 
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Table 13.5. The first few P polynomials and their form for large 


k = 2n +2. 

k n P 

2 0 1 

4 1 1-X+xX? 

6 2 1-4X+4+2x?- 4x4 x4 

8 3 1-3X+2X?- Bx -xt 3x? +X’ 

10 4 1-8X4+2X?- 8x94 2x*- 8x94 BX 3x74 X® 


œo œ (1=-X) Pee? si) 
vn ehs) (X=1-2) 


The numerator turns out to have a (1 + X) factor, so that P is 
a polynomial of degree 2n in X. The first few P’s are shown in 
Table 13.5. These polynomials also have a “reflexive” property. 

Yet another expression for € is 


(1+X)P 


= > 13.63 
cP (13.63) 
which can be proved by substituting for P and simplifying to reach 


Eq. (13.55). Using this form of € in Eq. (13.52) gives 
B =(1- XPFP (rs), (13.64) 


As noted in the tables, both F and P polynomials have simple 
limits as k — oo, provided that X 4 1. It is easy to see that X > u 
and that all formulas revert to their NLS forms in this limit. Thus, it 
is clear that v* must ultimately tend to 1, so that R* = ag in accord 
with the BZ limit. 

However, to go further analytically and determine how fast the 
finite-order PWMR results approach their infinite-order form is beset 
with difficulties; the subtleties when X ~ 1 are crucial. The theory 
of hypergeometric functions when two parameters go to infinity is 
formidably complicated. Moreover, in any finite order we need to 
re-express both B and v as series, not in X but in u; then find u* from 
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the zero of the truncated B series; and then evaluate the truncated v 
series at u = u*. Nevertheless, we can explore these issues numerically 
with Mathematica. We have been able to explore up to k ~ 100 and 
the numerical results are presented in Table 13.6. It appears that v* 
approaches 1 significantly faster than in the NLS case: 


Ink/ko 


a (13.65) 


v~l- A 
with A ~ 0.08 and ko % 2.5, roughly. 
The ratio of v to its NLS form vys = TF stays very close to 1 
in the entire relevant range 0 < u < u*, although it strongly deviates 
thereafter. See Fig. 13.3 


Table 13.6. PWMR results in the 


BZ limit. 

k 4y*- 2 yak a 
ao ao ao 

2 1 1 1 


4 1.56878 0.99743 1.0526 
10 2.41100 0.99893 1.1064 
18 2.88641 0.99952 1.1371 
50 3.46514 0.99990 1.1869 
100 3.69257 0.99997 1.2183 


Fig. 13.3. Plot of v divided by units = as as a function of u for PWMR at 
k = 100. The curve is shown dashed beyond u = u* = 0.92314. 
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The v series is also much better behaved than in NLS, where 
the magnitude of the coefficients increased in arithmetic progression: 
unis = 4u $ ;(j +1)(—u)’. In the PWMR case, the coefficients vj in 


v = 4u `> v;(—u)? (13.66) 


are plotted in Fig. 13.4 for k = 100. The initial (j + 1) growth 
is suppressed by a more-than-exponential decay (a crude fit is 
(j + 1) exp(—0.0197/2)). The middle coefficient j = £ is exactly 
zero because of the k — 2j factor in the PWMR relation between s; 
and cj coefficients, Eq. (13.24). The coefficients remain very small 
thereafter. The somewhat bad behaviour of the last few coefficients 
is almost entirely suppressed by the uf factor, even at u = u*, the 
largest relevant u, and it actually plays a beneficial role. This can 
be seen in Fig. 13.5 which plots the partial sums of Nmax terms 
of the v series, Eq. (13.66), at u = u* in the case k = 100. The 
series has pretty well converged after 50 terms, but including 25 more 
terms significantly reduces the error. The very last term makes an 


ett 
50 60 70 80 390 100 


Fig. 13.4. Coefficients vj in the series expansion of v(u) = 4u poe v;(—u)’, for 
PWMR with k = 100. The inset shows the higher-order coefficients on a finer 
scale. 


Exploring All-Orders OPT in the Small-b (BZ) Limit 267 


(a) T T tr T T T 


(c) 1.00000 
0.99999 
0.99998 
0.99997 
0.99996 
0.99995 
0.99994 
0.99993 
90 92 94 26 98 100 
Nmax 


Fig. 13.5. The partial sums 4u 726” v;(—u*)? versus nmax for the v* series in 
the case k = 100. The plots use three different scales, so as to show that (a) 
the series has crudely converged after 50 terms but (b) a slight adjustment from 
50 to 75 terms reduces the error quite significantly, and (c) the last term makes 
an unexpectedly large change, given the trend of the preceding terms, but this 
further improves the result and means that the last term is, within a factor of 2, 
a good measure of the actual error. 
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unexpectedly large correction, but this further reduces the error and 
means that the last term provides quite a realistic error estimate. 

The series for 4*, which is just dG/da|*, is much worse behaved. 
Also the sequence of results for 7* in Table 13.6 appear to diverge, 
though at a much slower rate than in NLS. It is reasonable to hope 
that the extra subtleties in full OPT would lead to 4* converging to 
ag, albeit very, very slowly, in view of the low-order OPT results in 
Tables 13.1 and 13.2. 

We have not been able to extend the analysis to the full fixed- 
point master equation, (13.18). One can get to an equation similar to 
Eq. (13.54), but with an extra term involving u/(u — u*) that seems 
intractable. Moreover, the parameter u* can only be fixed after the 
B(u) function is found, and expressed as a truncated series, so the 
interaction between analytic subtleties and truncation effects is even 
more complicated and delicate. 


13.7. BZ Expansion in All-Orders OPT 


Setting aside the difficult issue of how fast results converge as k + ov, 
the results of the last section confirm that the simple NLS formulas 
from Sec. 13.5, 


4u 
l-u 


represent the all-orders limit of PWMR — and presumably of true 
OPT too — in the BZ limit. As previously noted, these formulas give 
the same BZ limit for R* and 4* as “regular” schemes. We now show 
that higher terms in the BZ expansion are reproduced correctly by 
all-orders NLS. 

Before discussing the general proof it is instructive to look at 
next-to-leading order in the BZ expansion. At this level, we now 
need two of the invariants, c and p2 so we take 


Bgc =1+eR+ pR?. (13.69) 
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(In fact, only the p21 piece of p2 would contribute when we 
re-expand the results in powers of ap. However, it will not be 
necessary to carry out that step explicitly, since once we show 
equivalence to the EC scheme, a “regular” scheme, we are bound 
to get the same BZ expansion to the corresponding order in ag.) 
Recall that the NLS condition and the invariants master equation 
together lead to Eq. (13.28), 


Bac (R), (13.70) 


which now gives 


i -fZ (13.71) 
RVJ1 +R + pyR2 a 


Integration yields 


In ee = Ina, (13.72) 
2+¢R+2\/1+cR+4 pR? 


where the constant of integration has been fixed so that R — a as 
a — 0. One can now exponentiate and solve for R, and then B(a) can 
be found from dR/da. As before we define u = —ca/4 and v = —cR. 
The zero of B is at 


1 
u* = — (13.73) 


E 


and in terms of these variables we find 


4 
v= — (13.74) 
(1+ 2u + 4) 
1-4 
B = ——— >. (13.75) 


(1+ 2u + %)2 


It is now straightforward to check that v evaluated at u = u* gives 


x 4 
R*=- =- (1- 1), (13.76) 
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which is the root of Bgco(R) = 0. Thus, the R* of all-orders NLS 
agrees with the R* of the EC scheme. Also, 4*, defined as the infrared 
limit of Eq. (13.36), which leads to 


* 


yaa , (13.77) 


= da 


with the factor-of-2 subtlety as in Sec. 13.5, can be shown to reduce 
to 


* 


2 dBre (13.78) 


oe _pe Buc 
7 dR 


which is the 4* of the EC scheme. 

The general proof is really just a special case of the general formal 
arguments that R* and 4* (properly defined) are invariant under RS 
transformations. From Eq. (13.70) we can see immediately that B(a), 
equal to dR /da in NLS, must vanish when Bgc vanishes; thus the R 
evaluated at a = a* in NLS must agree with the R* defined as the 
zero of the EC £8 function. Furthermore, the equivalence of the two 
equations for 7* above can be proved just from the NLS condition 
B = dR/da and Eq. (13.70), without assuming any specific form for 
Buc. 


13.8. a — a**/a Duality 


It is easily verified that under u > u*?/u the v of Eq. (13.74) remains 
invariant, while the B of Eq. (13.75) transforms to —(u?/u*?)B. 
These properties are even easier to spot in Eqs. (13.67), (13.68), 
in the BZ-limit case, where u* = 1. 
Let us try to trace the origin of these properties. Consider a 
transformation 
y2 
a — —, (13.79) 
a 
with some positive constant A. We postulate that R and all the 
pi invariants remain invariant and that the (-function equation, 
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use = (a) maintains its form. The latter condition means that 
d 
= = —a2B(a), (13.80) 


where 7 = bln(u/A), must transform to 


E au 
where BT (a) = B (%), This requires 


=a). (13.82) 


If B(a) vanishes at a = a* then BT(a) must too. Thus \?/a* must 
be a zero of B(a). If we assume that there is only one zero, then we 
must take \ = a*. 

dR 
da 


The transformation of would be 


dR dR a* dR 
— =e, 13. 
da d(Ž) A2 da ( 3.83) 


Note that this is the same transformation rule as for B above. Thus, 
the NLS scheme-fixing condition, IR = B(a), transforms into itself. 
It is straightforward to check that the same is true of the invariants 
master equation (13.17). It thus seems that an a > a*?/a duality is 
not special to the BZ limit, but is a general property of all-orders 


NLS and hence of all-orders OPT. 


13.9. Conclusions 


While BZ results are most simply obtained in a restrictive class of 
“regular” schemes, the same results emerge from “irregular” schemes, 
though they then require consideration of all orders of perturbation 
theory. Results in OPT for the fixed-point value R* are never far 
from the BZ result and converge quite nicely to it. The error at 
(k+1)th order shrinks as In? k/k? in NLS, as ln k/k? in PWMR, and 
probably slightly faster in true OPT. These results provide some 
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insight into how the subtle features of OPT conspire to improve 
finite-order results. 

Of course, in the BZ limit the EC scheme, or any “regular” 
scheme, is clearly better than OPT; their results converge immediately 
to the right result. The BZ limit is a case where we have an extra 
piece of information — and the general principle that we should 
make use of all available information takes precedence here over the 
Principle of Minimal Sensitivity. One should keep in mind, though, 
that the BZ limit, nf — 164, is not a physical one. It is an open 
question whether or not OPT gives better results than the EC scheme 
for nf = 16, the closest physical case. 

The situation with the critical exponent y* is much less satisfac- 
tory. While the all-orders NLS formulas produce the correct result, 
the finite-order NLS and PWMR results do not actually converge. 
In true OPT the results might converge but, if so, the convergence is 
extremely slow. The problem may stem from trying to obtain y* as a 
by-product of the optimization of R*. If one is principally interested 
in y* itself, then one should construct its own perturbation series 
and optimize that. However, our reason here for studying y* was 
not for its own sake, but as a shortcut to obtaining R(Q) at non- 
zero Q, relying on Eq. (13.10), which holds for the first three orders 
of the BZ expansion. That was very convenient because we only 
needed the optimization conditions at the fixed point, and these are 
analytically much simpler than for general Q. However, the natural 
procedure is to optimize R(Q) itself. There is no reason to suppose 
that the convergence of OPT for R(Q) at non-zero Q is significantly 
worse than for R*; indeed, as Q gets larger we expect convergence 
to become much better. Thus, our difficulties with y* are probably 
a technical, mathematical issue, rather than a problem of physical 
concern. 

A key result is the “fixed-point OPT master equation” (13.18) 
which opens a route to an analytical treatment of arbitrarily high 
orders of OPT, given knowledge of the p; invariants — although 
here we have only been able to make progress in two simplifying 
approximations, NLS and PWMR. It appears that the simple NLS 
approximation does yield the all-orders limit of OPT, although it 
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is a poor guide to the rate of approach to that limit. The NLS 
formulas, (13.33), (13.34) at leading order in the BZ expansion, 
and (13.74), (13.75) at next-to-leading order, are remarkably simple. 
They illustrate a general a + a*? /a duality property of all-orders 
OPT that is intriguing and deserves further study. 


Appendix 13.A: BZ Expansion at General Q 


We first examine the BZ limit. Here, the first two terms of the 6 
function dominate, so we may use the int-6 equation in the second- 
order form 


ca 


bIn(u/A) = —+ el 
n(u/A) = = +eln| | 


(13A.1) 


Combining this equation with the definition of p; (Q) yields the latter 
in terms of a. Recalling that in the BZ limit c ~ —1/ap, R* ~ a* ~ 
ao, and R ~ a, the limiting form is 


pP1(Q) = 7 + in (=). (13A.2) 


Inverting this equation (numerically) gives R as a function of 
p:(Q) = bln(Q/AR), and hence as a function of Q. This is the 
function sketched in Fig. 13.2. It is universal, at least within the 
class of “primary” physical quantities. 

To go beyond this limiting form and develop a systematic BZ 
expansion for R at a general Q is not a completely unambiguous 
matter. R(Q) is not expressible as a simple power series in ag, so 
some thought is required in deciding how precisely to define the nth- 
order approximant. The important point, as with any approximation, 
is to reconcile and make best use of all available information. Simply 
taking the int-G equation and then expanding it in powers of ag 
produces correction terms with (a9 —R) denominators. Higher orders 
bring in ever more singular terms. However, these terms simply arise 
from an expansion of In(R* — R), reflecting the fact that the fixed 
point R* does not stay at ao, but is itself a series in aj. Therefore, 
it is sensible to organize the expansion to reflect this fact. 
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Thus, before performing the integration of 1/8(«), where B(x) = 
B(a)/b, one should first rewrite it as 


1 —a* 


ET Ee eas ee Ye aT a ey 13A. 
—x?(1 + cx + cox? + ++) x*(a* — £)P(x)’ ( 3 3) 


ensuring that the pole is in the right place, and then express it in 
partial fractions as 


ee Se ee (134.4) 


The coefficients of the first three terms are determined by the x — 0 
and x — a* limits. Hence 4* is y*/b, where y* is the slope of the 8 
function at the fixed point. The remainder term can be expanded as 
a power series, H(x) = Ho + Hix +- 

In nth order of the BZ expansion one may truncate the 8 function 
after n+1 terms. In that case H(z) is initially of the form Q(x)/P(z), 
where P(x) and Q(x) are polynomials of degree n — 1 and n — 2, 
respectively. (For n = 1, Q(x) vanishes.) The coefficients of P(x) are 
of order unity as ag — 0. The coefficients of Q(x) are of order ag 
because of cancellations that make both c + 1/4* and a*/4* — 1 of 
order ao. For instance, Hp = ag(c4,-1 + 2c2,-1¢3,-1 + B) + O(a?). 
In the BZ expansion to nth order (n > 4) one needs coefficients up 
to H,—4: for the first three orders one can drop H(x) altogether. 

It is now straightforward to integrate 1/Ê(x) in the form of 
Eq. (13A.4) and hence obtain p,(Q) in terms of a and a*. One 
may then eliminate a and a* in favour of R and R*, working to 
the appropriate order. The last step can be short circuited by noting 
that the final result must be RS invariant, and so, without loss of 
generality, one may choose to work in the EC scheme in which R = a. 
Thus, the result to nth order in the BZ expansion can be expressed as 


1 1 R SHER 


(13.5) 
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where R*( and 4*(™ are the nth-order approximations to R* and 
4, respectively. For small R (i.e., at large Q) this formula will agree 
with (n + 1)th-order perturbation theory to the appropriate order in 
R and apo. 


Appendix 13.B: Pinch Mechanism Infrared Limit 


As discussed in Sec. 11.6, a finite infrared limit in OPT can occur 
through a pinch mechanism: The evolving B(a) function of the 
optimized scheme develops a minimum that “pinches” the horizontal 
axis at a “pinch point” ap, which ultimately becomes a double zero of 
B(a). The infrared limit of the couplant, however, is at an “unfixed 
point” a* > ap that is not a zero of the 6 function. The approach to 
Q =0 involves 1/(In Q)?, and so is characterized by y* = 0. 

In the BZ limit, ns — 164, the pinch mechanism does not seem 
to occur in true OPT, at least as far as we have been able to explore 
it in Sec. 13.3. However, the mechanism is very close to being relevant 
because in the BZ limit the critical exponent 7y* ~ bao tends to zero. 
A small or zero y* gives rise to a sharp infrared “spike” in R plotted 
versus Q, as in Fig. 13.2 or Fig. 11.4 

The NLS and PWMR approximations to OPT seem to have fixed 
points only in every other order (for odd k in NLS, and even k in 
PWMR). In these orders, as discussed in Sec. 13.5, the B(u) function 
closely approximates its limiting form (1 — u)/(1 + u)? until u gets 
close to 1, when it suddenly dives down, producing a zero. In the 
alternating orders B(u) suddenly shoots upwards and there is no 
zero. However, B(u) then has a minimum very close to the horizontal 
axis, so only a slight modification of the scheme would produce a 
“pinch point.” 

We first show that, in circumstances where the pinch mechanism 
does govern the infrared limit of OPT, the master equation that 
replaces Eq. (13.18) is 


Ge Taal) POG ae eal) 
(13B.1) 
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(Superscripts “(k + 1)” on R and B(a) are omitted for brevity.) 
Except for the prefactor, and the fact that ap (not a*) replaces a* in 
the last term, this equation is identical to (13.18). 

The derivation starts from Eq. (11.33) for the sm coefficients in 
infrared limit and uses a dummy variable a to form the function 


Sla) == = yee (13B.2) 


Reorganizing the resulting double summation over m and j so that 
the latter becomes the outer summation, the inner summations 
become finite geometric series or derivatives thereof. The outer 7 
summation then produces terms that are B(a) or dB/da or B(ap) 
or dB/da|,_,,.- The last two vanish in the infrared limit since ap is 
then a double zero of the B(a) function. After some further algebraic 
tidying up the result reduces to Eq. (13B.1) above. 

Note that the naive large-k limit of Eq. (13B.1) is not the NLS 
condition (13.26) but 


E = (2) B(a) (NLS’). (13B.3) 


If we proceed in parallel with the analysis in Sec. 13.5 we find, instead 


of Eq. (13.31), 
dv f du [1 —u/ur* 
|== S S (13B.4) 


Note that the above equations correspond to the ansatz form used 
in the PWMR analysis of Sec. 13.6 with € replaced by 


1 — u/u* 
£> A (13B.5) 


Doing the integrations, exponentiating both sides, and solving for v 
leads to 


4U 
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where 


1l—u/u* 2 — 
v= (>) Tua 1 (Za) væ 


u* — Up —— +1 Vu* + ,/Up 


(13B.7) 


Note that when u > up (which is relevant since u ranges from 0 to 
u*, which must exceed up) this formula for U develops an imaginary 
part. However, recall that both v and B, 


1=U) U jl— wo, 
B= e er) 


(cf. Eq. (13.52)), have to be expanded as series in u and then 
truncated after k terms, making them inevitably real. 

These formulas are hard to deal with, even at low orders, 
especially since up and u* have to be determined by the requirements 
that the truncated B and its derivative vanish at the pinch point up. 
For k = 2,4 there does not seem to be any viable solution, but for 
sufficiently large k it appears there is. Anticipating that both up and 
u* will tend to 1 as k > o, we define 


(13B.9) 


and proceed to expand to lowest non-trivial order in 6. This gives 


Uxu (1 — ° in(d — W), (13B.10) 
o 4u tz) aii 
VR pay eray l) (13B.11) 
and 
a LSU ô u (1 — 4u +u?) 
saa aea a t-4): 


(13B.12) 
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Table 13.7. NLS’ results, to lowest-order in 6, in 
the BZ limit. 


k Up u* ô v= 


100 0.95018 0.97735 0.02925 0.71485 
600 0.98819 0.99292 0.00482 0.95982 
10,000 0.99895 0.99924 0.00029 0.99856 


Remarkably, one can find analytic expressions for the truncated- 
series versions of v and B and thereby explore numerical results up to 
very high k values. These results (see Table 13.7) show that indeed 
there is a valid solution (with u* > up) with ô tending to zero as 
ô ~ (2/1n2)(1/k) and R*/ao tending to 1. 
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Chapter 11: This chapter is based largely on [18], which builds 
upon [11, 14, 15]. For fixed point lore see the reviews cited under the 
“Renormalization Group” heading above, or any modern quantum- 
field-theory textbook. 

The ideas referred to in the third paragraph are exemplified 
by Bloom—Gilman duality [89] and Poggio-Quinn—Weinberg (PQW) 
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